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SUMMARY 


This  test  manual  provides  information  on  the  use  and  interpretation  of 
scores  derived  from  the  Air  Force  Officer  Qualifying  Test  (AFOQT)  Form  P. 
Since  1951,  the  AFOQT  has  been  part  of  the  selection  process  for  officer 
commissioning  programs  and  pilot  and  navigator  training.  The  latest  cf  16 
successive  forms  of  the  AFOQT,  Form  P,  is  the  first  to  be  developed  in  two 
parallel  forms. 

The  manual  briefly  describes  the  evolution  of  the  AFOQT  across  its 
periodic  revisions  designed  to  improve  predictive  validity  and  prevent  test 
compromise.  A  description  of  each  subtest  and  composite  is  provided,  and 
sample  problems  are  presented. 

Fvpf>riment;al  AFOQT  items  were  constructed  and  field  tested  with  the  goal 
ot  providing  a  large  AFOQT  item  bank.  Analyses  of  the  data  provided 
information  important  for  Form  P  item  selection.  Form  P,  like  Form  0, 
contains  380  items  organized  into  16  subtests  which  form  five  composites: 
Pilot,  Navigator-Technical,  Academic  Aptitude,  Verbal,  and  Quantitative. 
Resemblance  to  Form  0  in  terms  of  psychometric  properties,  content,  and 
style  was  the  criterion  for  Form  P  item  selection.  Preliminary  analyses  of 
Forms  PI  and  P2  based  on  the  test  data  of  airmen  attending  Basic  Military 
Training  at  Lackland  AFB  established  that  the  forms  are  highly  similar  to 
each  other,  as  well  as  being  comparable  to  Form  0. 

Additional  evidence  of  the  comparability  of  the  forms  is  given  in  some 
detail  for  officer  applicant  samples  tested  during  the  Initial  Operational 

Test  and  Evaluation  (I0T&E)  of  AFOQT  Form  P.  Central  characteristics, 
reliability  (KR-20),  standard  error  of  measurement,  item  difficulty  and 
discrimination,  and  item  content  are  reported.  Intercorrelations  among  the 
Form  P  subtests  are  interpreted  for  construct  validity.  Included  are 
summaries  of  criterion-related  validity  studies  relating  AFOQT  Form  0  to 
performance  measures  for  officer  commissioning  programs  at  Officer  Training 
School  (OTS)  and  the  Air  Force  Reserve  Officer  Training  Corps  (AFROTC); 
entry-level  pilot  and  navigator  training;  and  non-rated  officer  training. 
Statistically  significant  positive  validity  coefficients  were  obtained  in 
all  of  the  studies.  Validity  studies  for  Form  P  are  planned,  and  it  is 
expected  that  the  predictive  validity  of  Form  P  will  be  similar  to  that  tor 
Form  0  because  of  the  comparability  of  Forms  0  and  P. 


Test  administration  and  scoring  considerations  are  discussed.  Topics 
include  testing  locations,  standardized  procedures,  test  security,  testing 


schedule,  and  score  interpretation.  Research  on  a  Quick  Score  procedure  to 


prescreen  applicants  for  OTS  and  AFROTC  is  reviewed.  Appropriate 
for  the  use  and  interpretation  of  the  AFOQT  are  cited. 
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PREFACE 


The  Air  Force  Human  Resources  Laboratory  (AFHRL)  is  designated  as  the 
primary  test  development  agency  for  the  Air  Force  Officer  Qualifying  Test 
(AFOQT)  by  Air  Force  Regulation  35-8,  Air  Force  Military  Personnel  Testing 
System.  The  current  research  and  development  (R&D)  effort  was  undertaken  as 
part  of  AFHRL's  respons ibi 1 i ty  to  develop,  revise,  and  conduct  research  in 
support  of  the  AFOQT.  Work  was  accomplished  under  Task  771918,  Selection 
and  Classification  Technologies,  which  is  part  of  a  larger  effort  in  Force 
Acquisition  and  Distribution  Systems.  The  project  was  completed  under  Work 
Unit  77191824,  Officer  Item  Pool  Development,  as  a  contractual  effort 
(Contract  F3361 5-83-C-003 5 )  by  Psychometrics  Inc.,  Sherman  Oaks,  CA. 

Ms  Jacobina  Skinner  represented  the  monitoring  agency  as  contract  manager. 


TABLE  OF  CONTENTS 


Page 

I.  BACKGROUND .  1 

History  of  the  AFOQT .  1 

Description  of  the  Subtests . .  4 

Composites . 10 

II.  DEVELOPMENT  OF  FORM  P . H 

Experimental  Item  Construction  .  11 

Analysis  and  Results . 12 

Parallel  Forms  Development  .  13 

III.  STATISTICAL  ANALYSIS  .  15 

Comparability  of  Forms  PI  and  P2 . 15 

Score  Characteristics . 15 

Item  Characteristics . 18 

Reliability  of  Subtests  and  Composites  .  19 

Standardization  .  21 

Validity . . . . . 23 

IV.  ADMINISTRATION  AND  SCORING . J() 

Administration  .  30 

Scoring . . . 32 

V.  SUPPLEMENTAL  SOURCES  OF  AFOQT  INFORMATION  .  35 

REFERENCES . 3  7 

APPENDIX  A:  Content  of  Forms  M,  N,  and  0  of  the  AFOQT . 41 

APPENDIX  B:  United  States  Air  Force  Officer  Qualifying  Test 

Information  Pamphlet:  Subtest  Sample  Items  .  45 

APPENDIX  C:  Skew  and  Kurtosis  of  AFOQT  Form  P  Subtest  and 

Composite  Raw  Scores . 77 

APPENDIX  D:  Item  Analysis  Summary  . . . . 79 

APPENDIX  E:  AFOQT  Subtest  Intercorrelation  Matrices  for 

Forms  PI  and  P2 . 87 

APPENDIX  F:  Raw  Score  to  Percentile  Conversion  Tables  .  91 

APPENDIX  G:  AFOQT  Bibliography  .  103 

iii 


LIST  OF  TABLES 


Table  Page 

1  Organization  of  AFOQT  Form  P  Subtests  and  Composites  .  2 

2  Me^n  Item  Difficulty  (£)  and  Discrimination  Statistics  (r^y) 

for  Experimental  Items  Selected  for  Forms  PI  and  P2 .  14 

I  Score  Means  and  Standard  Deviations  of  Common  Items 

on  AFOQT  Forms  PI  and  P2 .  16 

4  Means  and  Standard  Deviations  of  AFOQT  Forms  PI  and  P2 

Subtest  and  Composite  Raw  Scores  .  17 

5  Internal  Consistency  Reliability  (KR-20)  and 

Standard  Error  of  Measurement  (SEM)  .  20 

6  Composite  Reliabilities  and  Standard  Errors  of  Measurement 

(SEM)  Estimated  From  Subtest  KR-20  and  Communality  Values  ....  20 

7  Intercorrelations  of  Composite  Raw  Scores  for 

AFOQT  Forms  PI  and  P2 .  22 

8  Correlations  of  AFOQT  Form  0  Composites  with  Civilian  Tests  ...  29 

9  Testing  Schedule  .....  .  31 

A-l  Content  of  AFOQT  Form  M  (1975) .  42 

A-2  Content  of  AFOQT  Form  N  (1978) .  43 

A-3  Content  of  AFOQT  Form  0  (1981) .  44 

C-l  Skew  and  Kurtosis  of  AFOQT  Form  P  Subtest  and 

Composite  Raw  Scores .  78 

D- 1  Mean  Difficulties,  Biserials,  and  Point  Biserials,  AFOQT  Form  PI  80 

0-2  Distribution  of  Item  Difficulty,  AFOQT  Form  PI .  81 

D-3  Distribution  of  Item  Discrimination,  AFOQT  Form  PI .  82 

0-4  Mean  Difficulties,  Biserials,  and  Point  Biserials,  AFOQT  Form  P2  83 

D-5  Distribution  of  Item  Difficulty,  AFOQT  Form  P2 .  84 

D-6  Distribution  of  Item  Discrimination,  AFOQT  Form  P2 .  85 

E-l  Subtest  Raw  Score  Intercorrelations,  AFOQT  Form  PI  .  88 

c-2  Subtest  Raw  Score  Intercorrelations,  AFOQT  Form  P2  .  89 

F- 1  AFOQT  Form  PI  Pilot  Composite  Conversion  Table  .  92 

F-2  AFOQT  Form  Pi  Navigator-Technical  Composite  Conversion  Table  .  .  93 

F-3  AFOQT  Form  Pi  Academic  Aptitude  Composite  Conversion  Table  ...  94 

F-4  AFOQT  Form  Pi  Verbal  Composite  Conversion  Table  .  95 

F-5  AFOQT  Form  Pi  Quantitative  Composite  Conversion  Table  .  96 

F-6  AFOQT  Form  P2  Pilot  Composite  Conversion  Table  .  97 

F-7  AFOQT  Form  P2  Navigator-Technical  Composite  Conversion  Table  .  .  98 

F-8  AFOQT  Form  P2  Academic  Aptitude  Composite  Conversion  Table  ...  99 

F-9  AFOQT  Form  P2  Verbal  Composite  Conversion  Table  .  100 

F-10  AFOQT  Form  P2  Quantitative  Composite  Conversion  Table  .  101 


IV 


AIR  FORCE  OFFICER  QUALIFYING  TEST  (AFOQT)  FORM  P: 

TEST  MANUAL 
I.  BACKGROUND 

Since  1951,  when  the  test  appeared  in  a  preliminary  version,  the  Air 
Force  Officer  Qualifying  Test  (AFOQT)  has  been  part  of  the  selection  process 
for  Air  Force  officer  commissioning  programs  and  for  pilot  and  navigator 
training  programs.  The  latest  of  16  successive  forms  of  the  test,  Form  P, 
was  the  first  to  be  constructed  in  two  parallel  forms.  The  development, 
measurement  characteristics,  and  equivalence  of  these  forms,  and  the  use  and 
interpretation  of  their  scores,  are  the  principal  topics  of  this  manual. 

History  of  the  AFOQT 


Purpose 

As  with  the  other  military  services,  the  U.S.  Air  Force  has  the  goal  of 
commissioning  officers  who  will  be  leaders  with  exceptional  levels  of 
ability.  The  primary  selection  criteria  for  officers  are  education  and 
aptitude  (Brown,  1987).  The.  AFOQT  is  used  to  measure  the  aptitudes  of 
candidates  for  two  of  the  three  officer  commissioning  programs:  the  Air 
Force  Reserve  Officer  Training  Corps  (AFROTC)  and  the  Officer  Training 
School  (OTS).  (The  third  program,  which  is  offered  at  the  Air  Force  Academy 
in  Colorado  Springs,  Colorado,  uses  other  criteria  for  selection  and  is 
exempt  from  the  AFOQT  requirement.)  Aircrew  classification,  a  function  that 
matches  aptitudes  to  the  task  requirements  of  pilots  and  navigators, 
constitutes  the  second  major  use  of  the  AFOQT. 

AFOQT  Form  P  contains  380  items  organized  into  16  subtests  which  form 
five  composites:  Pilot,  Navigator-Technical,  Academic  Aptitude,  Verbal,  ana 
Quantitative  (Table  1).  A  candidate  for  admission  to  AFROTC,  OTS, 
Undergraduate  Pilot  Training  (UPT),  or  Undergraduate  Navigator  Training 
(UNT)  must  meet  or  exceed  minimum  qualifying  scores  on  the  AFOQT  composites 
to  continue  in  the  selection  process.  The  validity  of  the  procedure  has 
undergone  continual  study  and  verification  to  provide  assurance  that  the 
AFOQT  is  a  highly  suitable  instrument  for  officer  selection  and 
c lass i f  icat ion . 

Evolution  of  the  AFOQT 


The  information  in  this  section  is  drawn  largely  from  Rogers,  Roach,  and 
Short  (1986),  who  noted  that  age,  education,  and  medical  fitness 
requirements  were  the  principal  considerations  for  selection  to  pilot 
training  programs  before  World  War  II.  Early  efforts  to  use  physiological 
or  psychological  tests  to  select  student  pilots  were  only  partially 
successful.  Following  the  onset  of  U.S.  participation  in  World  War  II,  the 
Aviation  Psychology  Program  was  initiated,  with  the  mission  of  developing 
pilot  selection  instruments.  Among  the  prominent  psychologists  and 
measurement  specialists  staffing  this  program  were  John  C.  Flanagan,  J.  P. 
Guilford,  Robert  L.  Thorndike,  and  Nea L  E.  Miller.  The  need  for  a  large 
number  of  aircrew  personnel,  as  well  as  the  shortage  of  physically  qualified 
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Table  L.  Organization  of  AFOQT  Form  P  Subtests  and  Composites 


S  u  b  t  e  s  t 

No .  of 
items 

Composite 

Pilot 

Na  v-Te  ch 

Academic 
aptitude  Ve 

rba  1 

Quanti¬ 

tative 

Verbal  Analogies 

25 

X 

X 

X 

Arithmetic  Reasoning 

25 

X 

X 

X 

Reading  Comprehension 

25 

X 

X 

Data  Interpretation 

25 

X 

X 

X 

Word  Knowledge 

25 

X 

X 

Math  Knowledge 

25 

X 

X 

X 

Mechanical  Comprehension 

20 

X 

X 

Electrical  Maze 

20 

X 

X 

Scale  Reading 

40 

X 

X 

Instrument  Comprehension 

20 

X 

Block  Counting 

20 

X 

X 

Table  Reading 

40 

X 

X 

Aviation  Information 

20 

X 

Rotated  Blocks 

15 

X 

General  Science 

20 

X 

Hidden  Figures 

15 

X 

Total 

380 

college  students  in  1941,  dictated  their  first  task--to  develop  a  general 
abilities  test  to  replace  the  previously  used  college  requirement.  The  result 
was  the  Aviation  Cadet  Qualifying  Examination  (ACQE). 

Initial  Screening  Tests.  The  ACQE,  a  general  intelligence  test  consisting 
o  l  l  i  terns ,  Wao  idK^u  oy  more  than  a  mi-iiiun  tuon  duixiig  the  war  to  qualify 
as  commiss  ioned  officers  and  pilots,  bombardiers,  and  navigators. 

The  title  of  this  screening  test  was  changed  to  the  Army  Air  Forces  Qualifying 
Examination  (AAFQE)  in  June  1944,  due  to  the  fact  that  the  test  was  by  then 
neing  administered  m  enlisted  men  (to  serve  as  gunners)  as  well  as  to 
aviation  cadets. 

Selection  and  Classification  Tests.  The  need  for  a  test  to  screen  and 
classify  potential  aircrew  members  resulted  in  the  development  of  the  Aircrew 
Classification  Battery  (ACB).  This  battery  included  psychomotor  tests  and 
both  power  and  speeded  pape r-and-penc i 1  tests  that  provided  Pilot,  Navigator, 
and  Bombardier  stanines  (composite  scores).  The  ACB  was  revised  many  times 
and  was  eventually  used  for  selection  as  well  as  classification. 

The  ACB  was  discontinued  in  1947,  because  of  a  decrease  in  applicant  flow, 
but  was  reinstated  in  1951  at  the  request  of  Headquarters  Air  Training 
Command,  which  was  finding  it  difficult  to  find  aviation  cadet  applicants  who 
had  2  years  of  college.  In  the  interim,  the  Aviation-Cadet  Officer-Candidate 
Qualifying  Test  (AC-OC-QT)  was  used  experimentally  to  screen  applicants  for 
Officer  Candidate  School  (OCS)  (now  OTS)  and  for  direct  commissioning,  to 
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screen  aviation  cadet  applicants,  and  to  measure  necessary  nontlying  aptitudes 
such  as  electronics.  The  AC-OC-QT  became  operational  in  October  1950  under 
the  name  Aviation  Cadet  Qualifying  Test  (ACQT).  The  ACQT,  because  ol  its 
ability  to  predict  success  on  the  ACB,  replaced  the  ACQE  as  a  screening  device. 

The  AFOQT.  The  first  version  of  the  AFOQT,  which  incorporated  the 
AC-OC-QT,  was  designed  in  1951  to  predict  success  in  OCS  and  to  screen  for 
aircrew  training.  Form  A,  which  consisted  of  the  USAF  Officer  Activity 
Inventory,  the  Attitude  Survey,  and  the  preliminary  AFOQT,  followed.  Five 
composite  scores  were  derived  from  Form  A:  Offii_ei  Quality, 

Observer-Technical,  Pilot,  Verbal,  and  Quantitative.  Form  B,  designated  the 
Officer  Selection  Form,  became  operational  in  September  1955,  replacing  the 
ACB  for  aircrew  selection.  The  AFOQT  forms  that  followed  (Forms  C  through  L) 
underwent  frequent  revisions  to  composites,  subtest  content  areas,  test  items, 
conversion  tables,  and  norms. 

Precursors  of  Form  P.  The  history  of  the  development  ot  Forms  M,  N,  and  0 
shows  the  way  in  which  the  design  of  Form  P  evolved.  Appendix  A  displays  the 
content  and  organization  of  these  forms.  Form  M  (1975)  (Table  A-l)  was  the 
same  as  its  predecessor,  Form  L,  with  the  exception  that  it  provided  a 
separate  female-specific  conversion  table  for  the  Pilot  composite  (Miller, 
1974).  A  comparison  of  Tables  A-l  and  A-2  indicates  the  structural  changes 
introduced  into  Form  N.  The  need  for  revision  of  Form  M  was  revealed  by  a 
validity  study  conducted  by  Valentine  (1977).  The  five  booklets  of  Form  M 
were  reduced  to  four.  Five  subtests  were  removed:  Officer  Biographical 
Inventory,  Pilot  Biographical  Inventory,  Aviation  Information,  Visualization 
of  Maneuvers,  and  Stick  and  Rudder  Orientation.  Seven  subtests  were  added: 
Background  for  Current  Events,  Pilot  Biographic  and  Attitude  Scale,  Table 
Reading,  Electrical  Maze,  Block  Counting,  Tools,  and  Rotated  Blocks.  Further, 
the  Quantitative  Aptitude  and  Verbal  Aptitude  composites  in  Booklets  1  and  2 
were  subdivided  into  subtests.  Three  subtests — Arithmetic  Reasoning,  Math 
Knowledge,  and  Data  Interpretation--replaced  Quantitative  Aptitude.  Word 
Knowledge,  Reading  Comprehension,  Background  for  Current  Events,  and  Verbal 
Analogies  replaced  Verbal  Aptitude. 

Changes  were  also  made  to  the  normative  base  for  Form  N  (Gould,  1978). 

The  norming  sample,  drawn  from  the  three  commissioning  sources  was  well  as 
active  duty  second  lieutenants,  was  designed  to  reflect  the  range  of  ability 
°xpected  in  the  officer  applicant  (.cpulaMon .  Norms  were  developed  for  the 
total  group  and  for  three  educat iona 1-le ve 1  groups:  less  than  2  years  of 
college,  more  than  2  years  (but  not  graduates),  and  college  graduates.  New 
conversion  tables  were  developed  for  the  three  educational-level  groups. 

Form  0  (1981)  dropped  the  subtests  titled  Background  for  Current  Events, 
Tools,  Aerial  Landmarks,  and  Pilot  Biographic  and  Attitude  Scale,  and  added 
Aviation  Information  and  Hidden  Figures  (Rogers,  Roach,  &  Wegner,  1986).  lorm 
0  was  the  first  AFOQT  form  to  be  equated  to  an  anchor  test.  Form  N,  through 
items  common  to  both  forms.  A  further  change  involved  the  number  of 
conversion  tables,  which  were  reduced  to  five — one  for  each  composite--as  it 
was  no  longer  considered  relevant  to  emphasize  educational  differences  in  the 
conversions  (Roach,  1986;  Roach  &  Rogers,  1982).  Form  0  was  also  associated 
with  administrative  changes  that  promoted  efficiency  in  test  administration 
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and  ri'iluci'i1  clinical  problems  in  achieving  scoring  accuracy.  That  is,  all 
subtests  ■  re  enclosed  in  a  single  booklet;  all  answer  sheets  were 
e  1  ec t r on  1 ca 1  l  v  scanned  and  computer-scored;  and  there  were  two  central  scoring 
locations,  Maxwell  AFB,  Alabama,  lor  AFRUTC  test  administrations,  and  brooks 
\FB,  Texas,  tor  all  others. 

One  of  the  most  salient  features  ot  the  later  AFOQ'l  forms  is  their 
continuitv,  maintained  by  t he  use  ot  "anchor"  or  common  iLems.  Form  P,  iike 
For. n  o ,  has  a  certain  proportion  ot  items  taken  t  rom  t  lie  previous  version. 

I  lie  essential  nature  of  the  aptitude  measures  found  by  the  Air  Force  to  be 
related  to  proticient  oflicer  performance  remained  unchanged  in  AFOQT  Form  P. 

Description  ot  the  Subtests 

"Aptitude"  is  usually  interpreted  to  mean  inherent  or  natural  ability  and 
"aptitude  test,"  a  measure  ot  the  ability  to  learn.  In  practice,  aptitude 
tests  frequently  measure  combinations  ot  potential  abilities  and  learned 
skills.  The  Math  knowledge  and  Arithmetic  Reasoning  tests  in  the  AFOQT,  tor 
example ,  resemble  achievement  tests  and  assume  some  prior  knowledge.  The 
AFOQT  cun  be  seen  as  an  aptitude  test  battery,  however,  in  that  the  prediction 
of  tuture  pe  r  t  o  rma  nc  e  —  bo t  h  in  training  and  on  the  job--is  its  primary 
purpose.  In  addition,  the  AFOQT  serves  a  classification  function  in  the  Air 
Force  personnel  system  by  distinguishing  between  such  assignments  as  pilot 
training  and  navigator  training  (Miller,  1969a).  The  AFOQT  subtests  and 
composites,  and  the  abilities  they  measure,  are  described  below.  Sample 
problems,  taken  trom  the  Air  Force  Ofticer  Qualifying  Test  Information 
Pamphlet  (Department  ot  the  Air  Force,  19b7)  are  provided  in  Appendix  B.  Die 
sample  problems  are  provided  for  illustrative  purposes  only  and  should  not  be 
viewed  in  and  of  themselves  as  a  blueprint  tor  the  test.  They  reflect  the 
difricultv  and  content  of  AFOQT  items  but  do  not  necessarily  conform  to  all  ol 
the  subtest  specifications  in  the  descriptions  which  follow. 

Verbal  Analogies  ( VAT 


Trie  items  in  this  subtest  take  two  forms.  'Ine  first  is  a  pair  of  words 
joined  by  the  phrase  "is  to"  to  express  a  relationship  between  them,  followed 
by  a  third  word,  the  stimulus.  The  examinee  must  choose  from  a  list  ot  five 
words  the  word  that  has  the  same  relationship  to  the  stimulus  as  that  between 
the  given  pair.  The  second  form  displays  in  the  stem  only  a  pair  ot  words 
with  an  implied  relationship.  Among  five  alternative  word  pairs  is  one  that 
expresses  the  same  type  ot  relationship  as  that  of  the  given  pair. 

The  relationships  between  words  are  of  several  types,  including  cause  and 
effect,  sequence,  antonym,  synonym,  degree,  action  to  object,  object  to 
action,  part  to  part,  part  to  whole,  member  to  category,  and  object  to 
attribute.  Among  the  types  ot  wrong  alternatives  may  be  homonyms  or  antonyms 
of  the  correct  answer;  wrong  tenses  or  grammatical  torms;  or  reversals  ot  the 
direction  ot  the  given  analogy.  To  preclude  ambiguity,  the  alternatives  are 
constructed  such  that  only  one  type  o t  relationship  in  a  given  word  pair  can 
be  interred. 
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(An  example  of  ambiguity:  Morning  is  to  evening  as 

A.  breakfast  is  to  dinner 

B.  sunlight  is  to  starlight 

(Hither  alternative  would  be  an  appropriate  analogy.) 

Verbal  ability  (i.e.,  proper  use  of  vocabulary  in  various  contexts)  is  one 
component  of  the  ability  required  by  the  VA  test.  Inferring  relationships  is 
another  component.  The  degree  of  remoteness  of  these  relationships 
contributes  to  the  difficulty  levels  of  the  VA  items.  The  more  inferences 
required,  the  more  difficult  the  item.  For  example,  the  relationship 

"eras s : green"  requires  the  examinee  to  make  fewer  inferences  than 
"grass: landscape."  The  less  central  the  attributes  of  the  analogy  are  to  its 
semantic  representation,  the  more  difficult  the  i  am .  For  example,  the 
relationship  "grass :green"  is  probably  more  central  to  our  conceptual 
representation  of  grass  than  the  relationship  "grass : landscape . "  In  general, 

the  "X:Y::A: _ "  format  is  easier  than  the  "X:Y::  :  "  format,  in  that  tne 

former  directs  the  search  for  alternatives. 

VA  items  do  not  test  knowledge  oi  specific  subject  matters.  The 
vocabulary  used  is  neither  complex  nor  esoteric. 

Arithmetic  Reasoning  (AK) 

The  AR  items  test  the  ability  to  understand  and  manipulate  re Lat ionsnips 
m  order  to  arrive  at  solutions  to  problems.  The  major  emphasis  is  on 
reasoning,  but  the  problems  require  knowledge  of  basic  arithmetic  functions 
such  as  addition,  subtraction,  multiplication,  division,  percentages,  ratios, 
proportions,  squares,  square  roots,  or  the  conversion  of  dimensions  in  a 
single  measurement  system  (e.g.,  yards  to  feet).  Knowledge  of  the  formulas 
used  to  calculate  the  lengths  of  the  sides  of  right  triangles  or  the 
circumferences  and  areas  of  circles  may  be  necessary  to  solve  some  problems. 

In  order  to  avoid  irrelevant  sources  of  difficulty,  calculations  are  simple 
and  not  time-consuming. 

The  AR  items  are  constructed  as  word  problems.  In  general,  a 
characteristic  such  as  height,  weight,  speed,  distance  travelled,  temperature, 
or  interest  earned  must  be  calculated  in  the  context  of  other  characteristics 
that  have  a  given  relationship  (e.g.,  half  as  fast)  to  a  given  object. 
Calculations  may  involve  the  conversions  of  dimensions  in  a  scale,  a 
requirement  implied  by  the  problem.  For  example,  if  told  that  a  line  is  20 
yards  long  and  asked  how  many  3-foot-long  objects  fit  on  that  line,  the 
examinee  must  infer  that  conversion  from  yards  to  feet  is  necessary.  Verbal 
ability  as  an  intervening  variable  is  minimized  by  use  of  basic,  simple 
vocabulary  in  the  item  stems. 

Knowledge  of  specific  subjects  is  not  necessary  for  the  AR  items. 

Measured  bv  this  subtest  are  the  abilities  to  analyze  problems  and  use 
reasoning  to  manipulate  the  components  analyzed,  in  order  to  satisfy  the 
problem  specifications. 


Reading  Comprehension  (KC) 


KC  items  require  the  examinee  to  read  and  understand  paragraphs.  Single 
reading  passages  ranging  from  approximately  40  to  150  words  in  length  are 
followed  by  five  alternatives  each,  one  of  which  completes  the  final  sevtence 
of  the  passage  appropriately.  The  incorrect  alternatives  may  complete  the 
last  sentence  in  a  grammatical  sense,  but  they  show  either  an  incomplete  or 
incorrect  understanding  of  the  given  passage.  The  subject  matters  of  the 
reading  passages  include  art,  economics,  the  environment,  literature, 
philosophy,  psychology,  science,  and  sociology.  Knowledge  of  these  subjects 
is  not  necessary  for  the  examinee  to  arrive  at  correct  answers.  Wrong 
alternatives  are  typically  ones  that  refer  to  elements  of  the  passage  but  fail 
to  capture  the  central  theme  or  continue  the  logic. 

The  comprehension  required  by  this  subtest  depends  upon  one's  abilities  to 
abstract,  generalize,  synthesize,  recognize  concepts,  and  reason 
cons t rue t i ve 1 y .  The  relevance  of  this  subtest  clearly  lies  in  its 
relationship  to  the  informational  aspects  of  high-level  Air  Force  jobs. 

Da  a  Interpretation  (PI) 

The  DI  subtest  consists  of  data  presented  in  tables  or  graphs.  These  data 
are  frequently  expressed  as  units  (such  as  thousands  of  hours)  which  are 
identified  either  above  the  data  in  a  column  heading  or  below  the  data  in  a 
footnote.  bach  set  of  data  is  followed  by  several  questions  pertaining  to  it, 
and  each  question  has  five  alternatives.  The  answer  to  any  one  item 
(question)  is  independent  of  the  answers  to  the  other  items  that  refer  to  the 
same  data. 

All  of  the  Dl  questions  require  examination  and  interpretation  of  the 
relationships  within  the  data  displayed.  Answers  to  some  questions  depend  on 
making  the  correct  associations  between  two  or  more  sets  of  numeric 
information  given  in  different  areas  of  a  table.  Otuer  questions  require  a 
series  of  simple  arithmetic  calculations  or  estimates  such  as  the  computation 
of  ratios,  percentages,  proportions,  or  ranges  after  interpreting  the  given 
data.  Most  of  these  calculations  involve  whole  numbers,  and  the  alternative 
answers  rarely  have  more  than  one  decimal  place. 

Incorrect  alternatives  may  be  values  or  variables  obtained  by  locating 
and/or  comparing  incorrect  celLs,  arriving  at  intermediate  solutions, 
performing  incorrect  computations,  misunderstanding  the  meaning  ot  cells, 
and/or  failing  to  convert  cell  numbers  to  their  correct  scales.  The  subject 
matter  ot  the  charts  does  not  play  a  significant  role  in  the  examinee's 
ability  to  answer  the1  DI  questions. 

The  1)1  subtest  calls  upon  one's  abilities  to  locate,  interpret,  and 
integrate  different  types  ot  data  appearing  in  a  variety  of  tabular  and 
graphic  structures,  in  order  to  satisfy  problem  requirements.  Recognizing 
relationships,  as  well  as  basic  arithmetic  reasoning,  are  other  abilities 
involved  in  this  test. 
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Word  Know  ledge  (WK) 


The  WK  subtest  measures  vocabulary  acquisition,  a  straightforward  aspect 
of  verbal  ability.  The  items  are  cast  as  synonyms.  A  word  is  given  and  then 
followed  by  five  alternatives;  one  of  these  alternatives  has  the  same  meaning 
as  the  given  word.  Incorrect  alternatives  include  words  that  may  sound  or 
look  like  the  given  word,  have  a  synonym  that  sounds  or  looks  like  the  given 
word,  are  opposite  in  meaning,  or  are  a  commonly  mistaken  meaning  for  the 
word.  No  technical  or  scientific  terms,  foreign  expressions,  or  proper  names 
are  included.  All  words  are  in  the  singular  form  unless  they  have  acquired  a 
use  more  common  in  plural  form  (e.g.,  data,  cross-purposes).  The  vocabulary 
is  at  the  high  school  completion  level. 

Math  Knowledge  (MK) 

The  MK  subtest  presents  items  requiring  knowledge  of  mathematical 
relationships.  The  item  stems  are  either  word  or  number  problems.  The  item 
types  include  symbolic  fractions,  decimals,  and  square  root  solutions. 
Understanding  of  mathematical  terms  such  as  "reciprocal"  or  "prime  number"  is 
assumed  by  some  items.  Incorrect  answers  to  the  problems  may  reflect 
solutions  arrived  at  because  of  common  misunderstandings,  lack  of  knowledge, 
or  inability  to  apply  learned  mathematical  relationships  accurately. 

Items  are  worded  such  that  reading  ability  does  not  play  a  significant 
role  and  none  of  the  calculations  involved  is  time-consuming,  complex,  or 
difficult.  MK  measures  mathematical  reasoning  and  mathematical  knowledge 
typical  at  a  first-year  college  level. 

Mechanical  Comprehension  (MC) 

The  MC  subtest  requires  acquaintance  with  and  understanding  of  basic 
physical  principles  such  as  friction,  centrifugal  force,  and  pressure,  and  the 
ways  in  which  these  principles  apply  to  the  operation  of  mechanical  devices. 
Diagrams  of  mechanical  situations  are  associated  with  75%  of  the  items;  each 
diagram  is  followed  by  several  items.  These  items  measure  such  things  as  the 
understanding  of  transfer  of  rotational  motion  and  rotational  range  of 
action.  The  remaining  items,  unillustrated,  require  knowledge  of  the 
mechanical  parts  of  engines  (particularly  automobile  engines)  and  knowledge  of 
hardware  and  tools.  Both  d iagr amma t ic  and  un illustrated  questions  deal  with 
functions  of  parts,  the  effect  of  movement  of  one  or  more  parts  on  other 
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Electrical  Maze  (EM) 


EM  items  ask  the  examinee  to  choose  a  correct  path  through  a  maze  from 
among  five  choices.  In  each  item,  five  boxes  of  equal  size  are  lined  up  at 
the  bottom  of  a  maze.  Within  each  box  are  two  dots  marked  S  (starting  point) 
and  F  (finishing  point).  A  line  leads  from  each  of  the  dots  marked  S.  Only 
one  box  will  have  a  line  from  the  S  point,  which  leads  to  a  single  circle  at 
the  top  of  the  maze  and  back  to  the  F  point  in  the  same  box.  Black  dots  which 
appear  where  two  lines  meet  or  intersect  show  the  only  places  where  turns  or 
directional  changes  may  be  made.  If  lines  meet  or  cross  where  there  is  no 
dot,  turns  or  directional  changes  may  not  be  made.  There  are  no  continuous 
loops  through  the  circle.  Lines  are  wrong  if  they  lead  to  a  dead  end,  if  they 
come  back  to  the  box  without  going  through  the  circle,  or  if  they  lead  to 
other  boxes.  Easy  items  tend  to  have  fewer  lines  and  dots  than  the  more 
difficult  i terns . 

Solvers  of  EM  problems  tend  to  take  a  tr ia 1-and-e rror  approach.  The 
examinees  with  the  highest  scores  may  develop  strategies  to  eliminate  wrong 
lines  quickly.  Spatial  aptitude  in  the  sense  of  being  able  to  ignore 
confusing  visual  cues  on  the  way  to  a  goal  is  what  is  being  measured.  With 
respect  to  aviation,  an  obvious  application  of  this  ability  is  in  tasks 
involving  electrical  wiring,  but  this  ability  may  also  be  related  to  piloting 
tasKs  involving  visualLy  seeking  locational  cues. 

Scale  Reading  (SR) 


The  SR  subtest  necessitates  reading  gauges,  dials,  and  meters.  Drawings 
of  these  devices  are  associated  with  either  one,  two,  or  three  items.  Each 
device  contains  one  or  two  scales  which  may  be  straight  or  curved,  and  which 
are  divided  and  subdivided  by  short  vertical  or  horizontal  lines.  The  width 
of  subdivisions  may  be  of  equal  intervals  or  based  on  logarithmic  scales. 
Numeric  values  are  given  at  two  or  more  points  on  each  scale.  The  examinee's 
task  is  to  estimate  the  numeric  value  indicated  by  an  arrow  pointing  to  the 
outride  edge  of  the  scale  at  a  point  which  is  either  between  or  outside  the 
two  given  values.  The  arrow  is  accompanied  by  the  item  number  so  that  the 
examinee  can  identify  the  appropriate  set  of  five  alternative  choices  before 
selecting  the  alternative  that  is  closest  to  his  or  her  estimate. 

The  more  difficult  items  employ  straight-line  logarithmic  scales,  curved 
scales,  scales  reading  from  right  to  left,  or  combinations  of  these.  The  SR 
subtest  measures  vi sua 1 -pe rce p t ua 1  acuity,  ability  to  comprehend  different 
tvpes  of  scales  and  unit  relationships,  and  ability  to  count  in  different 
scale  units. 

Instrument  Comprehension  (IC) 


I  <J  items  require  the  examinee  to  read  airplane  instruments  to  determine 
the  position  of  an  airplane  in  flight.  No  previous  experience  with  reading 
airplane  instruments  is  needed.  Fu 1 L  explanations  and  sample  items  are  given 
at  the  beginning  of  the  test.  Each  item  depicts  two  aircraft  instruments. 

One  indicates  the  amount  of  climb  or  dive,  and  the  degree  of  bank  to  the  lelt 
or  right.  The  other  is  a  compass  that  shows  the  direction  in  which  the 
airplane  is  headed.  The  two  instruments  are  accompanied  by  four  drawings  of 
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aircraft  in  flight.  The  examinee  must  select  the  one  illustration  that 
corresponds  to  the  readings  on  the  instruments. 

IC  items  vary  in  difficulty  depending  on  the  extent  to  which  the  amount  of 
climb  or  dive,  degree  and  direction  of  bank,  and  compass  heading  are  varied. 
Easier  items,  for  example,  have  one  or  more  of  the  following  elements:  no 
banking,  no  climb  or  dive,  and  a  heading  in  the  direction  of  one  of  the  major 
compass  points.  The  IC  items  draw  upon  not  only  form  visualization  and  scale 
but  also  their  relationship,  a  more  complex  aptitude. 

Block  Counting  (BC) 

The  task  in  the  BC  subtest  is  to  determine  the  number  of  blocks  that  are 
touched  by  a  designated  block  in  an  illustrated  pile.  Each  pile  consists  of 
11  blocks.  At  least  one  full  edge  of  every  block  in  the  pile  is  visible.  The 
pile  is  not  a  perfect  cube;  the  twelfth  block  is  "missing"  in  order  to  provide 
visual  information  on  the  other  blocks.  The  location  and  size  of  each  side  of 
the  missing  block  (usually  the  top  level)  are  clear  to  the  viewer,  and  two  of 
its  six  surfaces  are  obvious  from  the  gap  in  the  cube. 

Each  block-pile  illustration  serves  five  items.  Five  blocks  in  each  pile 
are  assigned  numbers  corresponding  to  five  item  numbers.  The  examinee  must 
count  how  many  blocks  in  the  pile  touch  the  numbered  block  and  choose  the  one 
answer  of  five  alternatives  that  matches  the  count. 

The  ability  to  visualize  the  abutment  of  surfaces  to  designated  objects  is 
what  is  measured  by  this  test.  It  is  one  of  several  distinct  approaches  in 
the  AFOQT  used  to  assess  spatial  ability. 

Table  Reading  (TR) 

The  TR  subtest  is  a  speeded  test;  that  is,  it  contains  more  items  than  can 
usually  be  answered  correctly  in  the  allotted  time.  It  tests  one's  ability  to 
read  tables  quickly  and  accurately  to  locate  and  identify  the  intersection  of 
two  coordinates.  Examinees  are  asked  to  locate  a  number  in  the  cell  of  a 
table,  given  the  X  and  Y  coordinates  of  the  cell.  Examinees  select  their 
answer  from  one  of  five  alternatives. 

The  table  is  a  square  matrix  containing  many  cells.  The  column  and  row 
headings  are  the  same  and  range  from  a  negative  number  to  its  equivalent 
positive  number.  Numbers  inside  the  cells  increase  from  left  to  right  and 
from  top  to  bottom.  A  cell  number  may  be  identical  to  the  cell  number  in  the 
cell  above  or  to  the  left  of  it  or  may  be  several  units  larger.  Given 
particular  X  and  Y  coordinates,  the  incorrect  alternatives  can  be  the  cells 
found  by  applying  incorrect  combinations  of  signs  to  the  coordinates,  by  using 
numbers  in  cells  adjacent  to  the  correct  cell,  or  by  using  numbers  that 
contain  the  same  digits  as  those  in  the  correct  cell,  but  in  a  different 
order.  The  easier  items  are  those  in  which  the  correct  cells  are  on  or  near 
the  border  of  the  table  and  where  the  X  and  Y  coordinates  are  of  the  same  sign. 

Aviation  Information  (AI) 

AI  items  measure  knowledge  of  general  aeronautical  concepts  and 
terminology.  Examinees  must  choose  the  one  of  five  alternatives  that  either 
best  completes  a  sentence  or  best  answers  a  question.  Items  cover  topics 
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such  as  the  functions  of  certain  airplane  parts,  the  proper  operation  of 
airplanes,  hazards  encountered  by  airplanes  and  how  these  may  differ  from 
one  type  of  airplane  to  another,  navigation  and  communication  techniques, 
aviation  weather  maps,  and  aeronautical  terms  and  concepts.  AI  items  are 
relevant  to  civil  aeronautics  rather  than  military  aeronautics.  The 
emphasis  of  the  items  is  on  light  aircraft  or  general  aviation  aircraft  with 
turbo  jet  or  reciprocating  engines. 

Rotated  Blocks  (RB) 

The  RB  subtest  requires  the  mental  manipulation  of  objects  in 
three-dimensional  space  and  visualization  of  the  changed  appearance  of  those 
objects  as  they  are  rotated.  Each  item  presents  a  drawing  of  a  given 
three-dimensional  block  followed  by  five  drawings  depicting  blocks  in  varied 
positions.  One  of  the  five  has  the  exact  configuration  of  the  given  dock 
but  is  shown  from  a  different  angle.  The  examinee  must  determine  which  one 
of  the  five  has  this  configuration.  The  drawings  are  made  large  and  clear 
to  preclude  calling  upon  acuity  of  visual  discrimination. 

The  aptitude  measured  by  RB  is  spatial  visualization. 

General  Science  (GS) 

GS  items  measure  knowledge  of  basic  scientific  terms,  concepts, 
principles,  and  instruments.  Questions  are  followed  by  five  alternative 
choices.  The  content  is  drawn  from  astronomy,  biology,  chemistry, 
electronics,  geography,  meteorology,  and  physics. 


Hidden  Figures  (HF) 


HF  items  examine  an  individual's  ability  to  see  a  simple  figure  in  a 
complex  drawing.  Five  distinct  figures  lettered  A,  B,  C,  D,  and  E  appear  at 
the  top  of  each  page.  Below  these  are  several  drawings,  each  embedding  one 
of  the  five  distinct  figures  in  a  complex  setting.  The  examinee  must 
determine  which  one  of  the  five  figures  appears  in  each  drawing.  The 
correct  hidden  figure  in  each  complex  drawing  will  always  be  the  same  size 
and  in  the  same  position  (unrotated)  as  it  appears  among  the  figures  at  the 
top  of  the  page. 


HF  calls  upon  visual  perception,  spatial  imagery,  short-term  memory,  and 
the  ability  to  ignore  irrelevant  and  confusing  visual  cues. 


Composites 


The  subtests  that  make  up  the  composites  were  shown  in  Table  1.  Logical 
analysis  and  empirical  data,  where  possible,  have  determined  the  recommended 
uses  of  these  composites  (Miller,  1969b). 


Pilot  (P) 


The  Pilot  composite  was  formulated  to  predict  Undergraduate  Pilot 
Training  success.  The  criterion  measure  used  to  evaluate  the  predictiveness 
of  this  composite  was  flying  deficiency  leading  to  removal  from  training. 
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High  scorers  have  been  shown  to  have  the  degree  and  kinds  of  aptitudes 
necessary  to  complete  training  successfully. 

Navigator-Technical  ( N-T ) 

The  N-T  composite  is  relevant  for  success  in  Undergraduate  Navigator 
Training,  other  training  programs  which  stress  mechanical  and  engineering 
concepts,  and  in  pilot  training  as  well. 

Academic  Aptitude  (AA) 

The  AA  composite,  which  emphasizes  verbal  and  mathematical  reasoning 
skills,  was  designed  to  predict  success  in  any  training  program  that  is  high 
in  academic  content.  The  academic  portions  of  the  OTS  and  AFKOTC  program 
curricula  are  examples. 

Verbal  (V) 

The  Verbal  composite  was  formulated  to  predict  training  success  in 
programs  that  require  verbal  skills,  such  as  administrative  services,  public 
information,  and  education  and  training. 

Quantitative  (Q) 

The  Q  composite  predicts  success  in  mathematically  referenced  training 
programs  such  as  accounting,  auditing,  statistical  services,  disbursing,  and 
supply. 

II.  DEVELOPMENT  OF  FORM  P 

In  1983,  anticipation  of  the  need  for  future  forms  of  the  AFOQT  prompted 
initiation  of  a  project  to  develop  a  pool  of  experimental  items  from  which 
selections  would  be  made  tor  AFOQT  Form  P  in  two  parallel  versions.  The 
advantages  of  constructing  two  forms  concurrently  were  expected  to  include 
improved  retesting  capability  and  reduction  of  opportunities  for  test 
compromise . 

Approximately  4,800  items  were  created  for  the  experimental  pool.  The 
major  requirement  guiding  item  development  was  that  the  subtests  were  to 
have  a  high  resemblance  to  the  Form  0  subtests  in  content,  appearance,  and 
distributions  of  item  difficulty  and  item  discrimination.  The  continuity  of 
instruments  proven  to  predict  officer  performance  would  thus  be  assured. 

The  process  of  Form  P  construction  summarized  in  this  section  of  the  manual 
is  described  at  length  in  Berger,  Gupta,  Berger,  and  Skinner  (1988). 

Experimental  Item  Construction 

New  Items 

The  experimental  items  were  written  to  be  consistent  with  Form  0  in  terms 
of  format,  semantics,  and  punctuation;  content  categories  of  certain 
subtests;  item  difficulty  range;  and  specific  requirements  for  graphics. 

Item  writers  were  provided  with  operational  definitions  that  described  the 
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constraints  they  must  observe  as  to  the  content,  scope,  complexity,  length, 
appearance,  and  number  of  items  and  response  options  (see  Appendix  B,  Sample 
Items).  When  appropriate,  the  content  categories  covered  in  selected 
subtests  were  expanded  to  include  a  broader  concept  of  the  area  tested;  for 
example,  adding  computer-related  questions  to  the  General  Science  subtest 
because  of  their  importance  in  the  science  fields.  Form  0  item  data  from 
officer  applicant  examinees  were  analyzed  to  gain  insight  on  the  factors 
affecting  difficulty,  and  these  factors  were  used  to  guide  Form  P  item 
development  to  approximate  Form  0's  range  of  difficulty. 

Common  Items 

Included  in  the  research  of  the  experimental  pool  were  anchor  or  common 
items  selected  from  Form  0.  Seven  booklets  of  new  and  common  items  were 
assembled  for  each  subtest  except  Scale  Reading  and  Table  Reading  (speeded 
tests);  for  these  two  subtests,  14  booklets  were  prepared.  (To  obtain 
accurate  item  statistics  for  the  last  items  in  the  speeded  tests,  the  tests 
were  administered  in  either  forward  or  reverse  order  to  seven  samples 
each.  )  The  common  items  for  each  subtest  appeared  in  all  seven  (or  14) 
booklets  of  that  subtest,  in  order  to  verify  that  the  different  samples 
tested  on  the  seven  booklets  of  a  subtest  were  comparable  in  ranges  of 
ability.  Comparability  was  essential  to  maximize  the  constancy  of  scale 
indices  for  item  difficulty  and  item  discrimination  measures  across  examinee 
groups.  Based  on  the  collective  set  of  findings  described  below,  the 
samples  were  judged  to  be  sufficiently  comparable  to  proceed  with  a  Form  P 
test  construction  strategy  in  which  item  indices  were  treated  as 
non-sample-specific.  A  second  important  reason  for  including  common  items 
was  to  provide  continuity  with  previous  AFOQT  forms. 

Field  Tests 

Approximately  350  airmen  attending  Basic  Military  Training  at  Lackland 
AFB,  Texas,  were  tested  on  each  of  the  experimental  booklets  between  August 
1934  and  December  1986.  Basic  airmen  constituted  the  only  practicable  group 
on  which  to  obtain  preliminary  data  for  evaluating  item  adequacy.  Because 
the  AFOQT  is  administered  for  operational  selection  and  classification 
purposes  at  about  500  military  testing  sites  in  the  Continental  United 
States  and  overseas,  it  was  not  logistically  or  economically  feasible  to 
field  test  the  several  thousand  new  test  items  with  officer  applicants. 
Supplemental  data  on  two  subtests,  General  Science  and  Aviation  Information, 
were  obtained  by  readministering  the  experimental  test  booklets  to  cadets 
attending  OTS  between  October  1985  and  January  1986.  Biographic  information 
was  collected  from  all  examinees. 

Analysis  and  Results 

Sample  Comparability 

To  evaluate  the  comparability  of  the  samples  of  test-takers,  the  7  to  14 
basic  airmen  samples  tested  on  each  experimental  subtest  were  compared  for 
biographic  and  score  characteristics.  Tables  which  show  the  analysis  results 
are  presented  in  Berger  et  al.  (1988).  A  summary  of  the  major  findings 
follows.  The  samples  appear  to  have  been  relatively  comparable  on  most 
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variables.  Age  differences  across  samples  for  each  subtest  were  tyoically 
less  than  1  year;  the  mean  number  of  years  of  education  rarely  varied  by  more 
than  2  months;  and  the  range  of  differences  in  percentage  of  ethnic 

representation  was  rarely  more  than  10%.  However,  the  range  of  male/female 
representation  in  the  various  samples  did  vary  considerably  for  some  subtests. 

The  various  airmen  samples  were  also  compared  for  test  performance,  first 
on  the  Armed  Forces  Qualification  Test  (AFqT)  composite  score  derived  from  the 
Armed  Services  Vocational  Aptitude  Battery  (ASVAB) .  The  mean  percentile 
scores  were  reasonably  close;  differences  across  the  samples  within  subtests 
ranged  from  2.2  (BC)  to  6.2  (AR) .  The  modal  difference  was  approximately 
three  points.  Performance  on  the  common  items  included  in  the  experimental 
AFOQT  test  booklets  provided  a  second  method  for  assessing  the  comparability 
of  the  airmen  samples.  On  the  average,  the  proportion  of  airmen  samples 
answering  the  items  correctly  differed  by  .09  or  less. 

The  OTS  cadets  who  took  the  AI  and  GS  tests  were  older  than  the  majority  of 
airmen  examinees  (mean  ages  25  to  26  years  and  19  to  20  years,  respectively). 
All  cadets  had  completed  at  least  16  years  of  education,  whereas  95%  of  the 
airmen  had  completed  12  to  14  years.  The  gender  representation  was  relatively 
stable  across  samples  within  subtests,  with  the  percentage  of  males  varying 
between  85%  and  90%.  Ethnic  representation  among  cadets  showed  a  pattern 
quite  similar  to  that  of  airmen,  but  there  was  a  slightly  higher  proportion  of 
White  cadets.  On  the  common  items  in  the  experimental  test  booklets,  cadet 
samples  performed  more  consistently  than  did  the  airmen  samples.  Mean  scores 
for  the  cadets  differed  by  only  about  .02  points  across  the  seven  samples 
administered  both  the  Aviation  Information  and  General  Science  subtests. 

Test  Statistics 

In  view  of  the  finding  that  samples  administered  each  subtest  were 
comparable  on  biographic  and  ability  indicators,  the  experimental  items  were 
evaluated  by  classical  item  analysis  procedures  within  each  sample.  These 
analyses  were  used  to  identify  candidate  items  for  a  reduced  pool  from  which 
Form  P  selections  would  be  made. 

Item  Analyses.  The  results  of  analyzing  the  experimental  items  indicated 
that  the  number  of  new  items  that  met  discrimination  index  and  difficulty 
standards  was  sufficient  to  proceed  with  the  development  of  Form  P. 

Requirements  were  that  item-total  score  biseriai  correlations  be  negative  for 
all  non-keyed  (incorrect)  alternatives  and  positive  for  the  keyed  (correct) 
responses.  Further,  biserial  correlations  for  keyed  responses  had  to  reach  or 
exceed  a  minimal  discriminative  standard  of  .40.  An  additional  consideration 
was  the  need  for  items  to  have  difficulty  levels  that  fell  within  the  range  of 
and  approximated  the  distribution  of  difficulty  for  Form  0  items. 


Para  1  lc  1 


Lie  ve  topment 


The  criteria  for  selecting  both  common  and  new  items  for  Forms  PI  and  P2 
from  the  experimental  pool  included  equivalent  mean  difficulty  lev.’...  each 

subtest,  comparable  mean  biserial  correlations,  and  comparable  distributions 
of  content  categories  for  applicable  subtests.  Similar  distributions  of  keyed 
responses,  and  stylistic  features  such  as  item  formats,  were  also  factors  in 
item  selection. 
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Each  Form  P  subtest  was  constructed  to  have  50%  of  its  content  the  same  as 
that  of  the  corresponding  Form  0  subtest.  This  was  accomplished  by  the  use  of 
common  items  drawn  from  Form  0.  Selection  criteria  were  acceptable  difficulty 
and  discrimination  statistics,  balanced  subject-matter  content,  balanced  keyed 
response  options,  and  an  average  item  difficulty  comparable  to  that  of  the 
full  set  of  items  in  the  Form  0  subtest.  In  addition,  the  positions  of  the 
common  items  were  considered  in  the  selection  process  in  order  to  balance 
items  taken  from  the  beginning,  middle,  and  end  cf  each  subtest. 

The  considerations  for  selecting  common  items  also  governed  the  selection 
of  new  items  for  Form  P.  The  characteristics  of  Form  0  items  to  be  replaced 
were  particularly  considered  and  item-by-item  matches  were  attempted.  A  number 
of  trade-offs  were  necessary  to  balance  content,  keys,  appearance,  and 
position;  but  in  general,  priority  was  given  to  matching  the  item  difficulty 
and  discrimination  of  Form  PI  and  Form  P2  subtests. 

Table  2  shows  the  experimental  item  data  for  the  Form  P  selections.  In 
the  context  of  the  analysis  of  experimental  items  (i.e.,  based  on  basic  airmen 
and  OTS  cadet  data),  Forms  PI  and  P2  were  highly  similar  in  subtest 
difficulty,  with  no  differences  for  11  subtests  and  differences  of  .01  for 
five  subtests.  Item  discrimination  was  exactly  the  same  for  PI  and  P2  in  four 
ouoi.esLs ,  ohowed  a  difference  of  .01  for  six  subtests  and  differences  from 
.02  to  .04  for  six  subtests. 


Table  2.  Mean  Item  Difficulty  (j>)  and  Discrimination  Statistics  (£bis) 
for  Experimental  Items  Selected  for  Forms  PI  and  P2 


Subtests 

£ 

£bis 

PI 

P2 

PI 

P2 

Verbal  Analogies 

.61 

.60 

.47 

.47 

Arithmetic  Reasoning 

.53 

.53 

.56 

.56 

Reading  Comorehens ion 

.64 

•  64 

.54 

.55 

Data  Interpretation 

.52 

.53 

.45 

.48 

Word  Knowledge 

.56 

.56 

.55 

.57 

Math  Knowledge 

.58 

.58 

.55 

.54 

Mechanical  Comprehension 

.50 

.49 

.39 

.39 

Electrical  Maze 

.38 

.38 

.68 

.67 

Scale  Reading  (speeded) 

.55 

.55 

.42 

.38 

Instrument  Comprehension 

.45 

.44 

.59 

.62 

Block  Counting 

.51 

.51 

.68 

.65 

Table  Reading3  (speeded) 

- 

- 

.71 

.70 

Aviation  Information 

.43 

.43 

.59 

• 

Ui 

00 

Rotated  Blocks 

.51 

.51 

.54 

.54 

General  Science 

.44 

.44 

.43 

.40 

Hidden  Figures 

.63 

.63 

.67 

.68 

aItem  difficulty  is  not  relevant  for  Table  Reading.  All  items  are 
inherently  easy. 
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Comparison  of  the  data  for  the  new  forms  showed  that  high  comparability, 
if  not  parallelism,  was  achieved  between  Forms  0  and  P.  The  equivalence  of 
the  items  in  Forms  pi  ancj  p2  is  discussed  in  the  next  section,  which  presents 
evidence  regarding  their  psychometric  properties  when  administered  in  the 
actual  Form  P  context  to  applicants  for  officer  commissioning  training 
programs . 


III.  STATISTICAL  ANALYSIS 
Comparability  of  Forms  PI  and  P2 

The  data  in  this  section  were  obtained  from  applicants  for  officer 
commissioning  at  operational  test  sites,  including  AFROTC  detachments  and 
Military  Entrance  Processing  Stations,  between  15  June  1987  and  31  October 
1987.  AFOQT  Form  PI  was  administered  to  3,216  applicants,  and  Form  P2  to 
2,976  applicants,  using  an  equivalent  groups  design.  Equivalent  groups  were 
assured  by  instructing  test  administrators  to  distribute  Form  Pi  and  then  Form 
P2  to  alternate  examinees  in  each  testing  session.  The  last  form  distributed 
was  recorded,  and  the  next  testing  session  was  begun  with  the  distribution  of 
the  other  form.  Scores  were  analyzed  for  central  characteristics ,  item 
difficulty  and  discrimination,  internal  consistency  reliability,  and 
intercorrelation  of  the  composites. 

Score  Characteristics 


Common  Items 


Fifty  percent  of  the  items  in  each  Form  P  subtest  were  drawn  from 
Form  0.1-  The  score  means  and  standard  deviations  of  the  items  selected  from 
Form  0  to  appear  in  common  in  Forms  PI  and  P2  attest  to  the  comparability  of 
the  two  samples.  Table  3  shows  that  the  largest  mean  score  difference  for  any 
subtest  was  less  than  one-third  score  point  (Math  Knowledge),  and  that  the 
differences  in  standard  deviations  were  similarly  small.  A  similar  pattern 
was  found  when  subtest  scores  were  combined  into  composite  scores.  The 
average  performance  of  the  samples  tested  on  Form  PI  and  on  Form  P2  differed 
by  about  .5  point  or  less  on  four  composites.  The  largest  difference  was  1.14 
points  on  the  Navigator-Technical  composite. 

Total  Scores 


Each  AFOQT  subtest  is  scored  as  one  point  for  each  correct  answer.  (There 
is  no  correction  for  guessing.)  Table  4  shows  the  mean  scores  and  standard 
deviations  of  the  total  test  (new  and  common  items)  for  Form  P  examinees. 

Form  P2  has  mean  scores  that  are  higher  than  those  of  Form  PI  for  nine 

subtests,  with  differences  ranging  from  .01  point  higher  (Electrical  Maze  and 
Table  Reading)  to  approximately  1.41  points  higher  (Reading  Comprehension). 


lAfter  the  Form  PI  and  P2  booklets  were  printed  and  distributed  to 
operational  test  sites,  two  common  items  were  determined  to  be  inadequate  in 
construction.  These  items  —  one  in  the  General  Science  subtest  ana  one  in 
the  Aviation  Information  subtest  —  remain  in  the  test  booklets  but  are 
omitted  from  the  test  scoring  procedure. 
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Table  3.  Score  Means  and  Standard  Deviations 
of  Common  Items  on  AFOQT  Forms  pi  and  P2 


Subtest 

Number  of 
common  items 
scored 

Form 

Mean 

Pi 

- ST 

Form 

Mean 

P2 

SD 

Verbal  Analogies 

13 

8.18 

2.44 

8.35 

2.44 

Arithmetic  Reasoning 

12 

7.34 

2.81 

7.10 

2.83 

Reading  Comprehension 

13 

8.47 

2.93 

8.57 

3.06 

Data  Interpretation 

12 

7.64 

2.39 

7.67 

2.40 

Word  Knowledge 

13 

7.52 

3.12 

7.53 

3.13 

Math  Knowledge 

12 

7 . 64 

3.16 

7.34 

3.10 

Mechanical  Comprehension 

11 

5.62 

2.36 

5.60 

2.39 

Electrical  Maze 

1U 

3.99 

2.11 

3.92 

2.03 

Scale  Reading 

20 

11.64 

'<.44 

11.36 

3.45 

Instrument  Comprehension 

10 

5.52 

2.73 

5.55 

2.82 

Block  Counting 

10 

6.07 

2.33 

5.78 

2.15 

Table  Reading 

20 

13.95 

3 . 64 

13.92 

3.50 

Aviation  Information 

9 

4.22 

2.24 

4.28 

2.22 

Rotated  Blocks 

7 

3.67 

1.76 

3.75 

1.81 

General  Science 

9 

4.22 

1.97 

4.20 

1.97 

Hidden  Figures 

8 

5.30 

1.77 

5.33 

1.74 

Composite 

Pilot 

103 

59.19 

14.20 

58.75 

13.99 

Navigator-Technica 1 

131 

77.10 

18.63 

75.96 

18.26 

Academic  Aptitude 

75 

46.79 

12.89 

46.56 

13.01 

Verbal 

39 

24.17 

7.26 

24.46 

7.44 

Quantitat  Lve 

36 

22.62 

7.09 

22.11 

7.11 

Note.  Form  PI  N  =  3,216;  Form  P2  N  =  2,976. 


The  seven  subtests  in  Form  Pi  that  have  higher  mean  scores  than  those  of  their 
counterparts  in  Form  P2  show  differences  ranging  from  approximately  .09  point 
higher  for  Word  Knowledge  to  1.37  points  higher  for  Math  Knowledge.  Restated, 
these  data  indicate  that  Form  P2  is  slightly  easier  for  nine  subtests, 
particularly  Reading  Comprehension,  and  Form  Pi  is  easier  for  seven  subtests, 
particularly  Math  knowledge.  Because  of  the  large  sample  sizes,  even 
differences  as  small  as  .01  raw  score  point  are  statistically  significant. 

The  differences  between  means  of  Forms  Pi  and  P2  tor  Reading  Comprehension 
(1.41  score  points),  Math  Knowledge  (1.37  score  points),  and  Block  Counting 
(1.09  score  points)  may  be  meaningful. 

When  the  subtests  are  organized  into  composites,  Form  Pi  is  shown  to  be 
slightly  easier  for  the  Pilot,  Navigator-Technical,  and  Quantitative 
composites,  with  differences  between  means  ranging  from  .61  to  3.34  score 
points.  The  Academic  Aptitude  and  Verbal  composites  appear  to  be  sligntly 
easier  in  Form  P2,  witn  their  means  being  .91  and  2.01  points  higher, 
respectively,  than  those  of  Form  PI.  The  largest  difference  between  means  was 
seen  for  Navigator- Technical  (3.34  raw  score  points). 
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Table  4.  Means  and  Standard  Deviations  of  AFOQT  Forms  PI  and  P2 
Subtest  and  Composite  Raw  Scores 


Subtest 

Number  of  (Form 

items  scored  Mean 

PI 

SD 

Fo  rm 

Mean 

P2 

SD 

ta 

Verbal  Analogies 

25 

16.21 

4.48 

16.90 

4.40 

6.11 

Arithmetic  Reasoning 

25 

15.48 

5.70 

15.09 

5.55 

2.72 

Reading  Comprehension 

25 

15.17 

4.99 

lo .  58 

5.24 

10.82 

Data  Interpretation 

25 

16.00 

4.50 

16.67 

4.83 

5  .b4 

Word  Knowledge 

25 

15.16 

5.74 

15.07 

5.78 

0.61 

Math  Knowledge 

25 

16.63 

6.24 

15.26 

6.09 

8. 8i 

Mechanical  Comprehension 

20 

9.93 

4.18 

9.9b 

4.17 

0.28 

Electrical  Maze 

20 

7.76 

3.74 

7.77 

3.48 

0.15 

Scale  Reading 

40 

23.79 

6.86 

22.51 

6.59 

1 .  b4 

Instrument  Comprehension 

20 

11.52 

5.27 

11.98 

5.45 

3.37 

Block  Counting 

20 

12.79 

4.34 

11.70 

3.99 

10.30 

Table  Reading 

40 

27.59 

7.11 

27.60 

6.84 

0.06 

Aviation  Information 

19 

8.49 

4.19 

9.05 

4.22 

7.0b 

Rotated  Blocks 

15 

8.00 

3.21 

8.47 

3.40 

5.58 

General  Science 

19 

9.34 

3.94 

9.15 

3.67 

1.9b 

Hidden  Figures 

15 

9.99 

3.12 

9.78 

2.95 

2.72 

Composite 

Pilot 

204 

118.08 

28.35 

117.47 

27.74 

0.8b 

Navigator-Technical 

264 

157.29 

38.38 

153.95 

37.43 

3.48 

Academic  Aptitude 

150 

94.65 

25.43 

95.56 

26.05 

3.07 

Verbal 

75 

46.54 

13.51 

48.55 

13.94 

5.7a 

Quantitative 

75 

48.11 

14 . 57 

47.01 

14.82 

2.94 

Note.  Form  PI  N  =  3,216;  Form  P2  N  =  2,976. 

aDue  to  the  large  sample  sizes,  all  differences  between  means  for  these 
samples  are  significant  beyond  the  .01  level. 


The  standard  deviation  indicates  the  way  measures  are  dispersed  around  a 
mean.  One  standard  deviation  on  each  side  of  the  mean  accounts  for  tne  scores 
of  approximately  two-thirds  of  any  sample  with  normally  distributed  scores. 

For  example,  in  interpreting  Table  4,  two-thirds  of  the  Form  PI  sample  had 
Block  Counting  (BC)  scores  that  differed  from  the  mean  score  of  12.79  by  4.34 
points  (plus  and  minus);  in  other  words,  the  said  scores  ranged  from  8.45  to 
17.13.  Two-thirds  of  the  P2  sample  had  BC  scores  ranging  from  7.71  to  15.69. 
The  scores  achieved  by  the  two  samples  in  the  range  of  the  mean  plus  or  minus 
one  standard  deviation  are  quite  close,  8.68  points  for  Form  Pi  and  7.98 
points  for  Form  P2,  indicating  quite  similar  spreads  around  their  respective 
mean  scores. 

Knowledge  of  the  standard  deviation  of  test  scores  is  useful  for  a  variety 
of  applications.  One  of  these  is  to  compare  the  scores  of  samples  on 
alternate  test  forms  to  see  if  one  of  the  forms  has  a  more  restricted  score 
range  than  the  other.  If  this  were  the  case,  there  could  be  different 
validity  outcomes  computed  for  the  two  forms.  (Validity  will  be  discussed  in 
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a  Later  section.)  Forms  Pi  and  P2  do  appear  to  be  comparable,  however,  with 
respect  to  the  standard  deviations  of  both  the  subtests  and  composites. 

Skew  and  Kurtosis 


Skew.  When  a  distribution  of  scores  is  not  symmetrically  bell-snaped,  it 
indicates  that  a  larger  proportion  of  the  group  scored  either  on  the  nigh  end 
(negative  skew)  or  the  low  end  (positive  skew)  rather  than  in  the  middle. 
Appendix'  C  indicates  that,  for  the  most  part,  Forms  Pi  and  P2  show  highly 
similar  skewness.  The  differences  between  the  PI  and  P2  subtest  coefficients 
of  skewness  range  from  zero  (SR  and  AI )  to  .17  (VA) .  The  differences  between 
Pi  and  P2  composites  range  from  .01  (N-T  and  AA)  to  .08  (V  and  Q).  All  of  the 
composites  are  negatively  skewed,  indicating  that  higher  scores  were  more 
frequent  than  lower  ones. 

kurtosis.  kurtosis  refers  to  tne  peakedness  of  a  score  distribution. 

Given  the  same  number  of  cases,  a  tall  and  narrow  curve  is  self-defined  as  a 
small  range  of  scores;  a  short  and  wide  curve,  as  a  wide  range  of  scores. 
i)epartures  from  normal  kurtosis  did  not  appear  to  be  extreme.  For  the  most 
p  irt ,  coefficients  of  kurtosis  are  comparable  for  the  PI  and  P2  subtests,  with 
t.  he  differences  ranging  from  zero  (AI)  to  .33  (TR).  Differences  in  kurtosis 
composite  scores  range  from  .01  (N-T)  to  .11  (Q) . 

Item  Characteristics 


Ar.a  lyses  were  conducted  on  the  data  of  the  two  samples  referenced  above  to 
obtain  item  difficulty  and  item  discrimination  statistics.  Appendix  D  shows 
the  mean  dilticuLty  and  discrimination  values  »nd  tne  distributions  of  the 
item  indices  tor  each  subtest  in  Form  Pi  (Tables  D-l  through  D-3)  and  in  Form 
P2  (Tables  D-4  through  D-G). 

item  Difficulty.  The  average  item  dilticulty  does  not  vary  between  each 
Form  Pi  and  Form  P2  subtest  by  more  than  .Ob  score  point  (Tables  D-l  and  D-4), 
md  tiie  difference  is  typically  much  smaller.  Furtner,  the  distributions  of 
item  difficulty  tor  the  two  torms  are  similar  (Tables  D-2  and  D-3).  These 
results  compare  very  favorably  to  the  data  for  the*  Pi  ana  P2  items 
administered  in  the  context  of  the  experimental  pool  (see  Section  11  above). 
The  parallelism  of  the  torms  with  respect  to  item  dilticulty  appears  to  be 
closer  in  these  data  tor  the  administration  of  Form  P  to  officer  applicants. 

Item  Discrimination.  Test  item  analyses  use  the  biserial  correlation  to 
measure  the  relationship  between  answering  an  item  correctly  and  being  in  a 
top  group  of  some  type,  usually  a  defined  top  sector  of  total  scores.  This 
correlation  should  be  high  for  the  keyed  answer  to  a  multiple-choice  test 
question,  and  negative  for  alL  tiie  wrong  options.  The  samples  of  applicants 
for  officer  commissioning  provided  data  indicating  that  the  AFOQT  subtests  in 
Forms  Pi  and  P2  had  mean  item-total  test  biccri^l  correlations  that  more  than 
;..ot  these  criteria  (Tables  D-3  and  D-b).  Mean  biserials  (Tables  D-l  and  D-4) 
ranged  from  .31  to  .72  in  Form  PI  (Scale  Reading  and  Math  Knowledge)  and  iron 
.48  to  .74  in  P2  (Scale  Reading  and  Instrument  Comprehens ion ) .  These  data 
compare  favorably  with  those  collected  tor  the  same  test  items  from  basic 
airmen  and  OTS  cadets  in  the  experimental  field  tests.  The  differences 
between  the  mean  biseriais  of  the  two  forms  were  never  greater  than  .Ob  for 
any  subtest,  providing  further  evidence  of  the  parallelism  of  Forms  Pi  and  P2. 
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Reliability  of  Subtests  and  Composites 


Several  testing  concepts  pertaining  to  the  consistency  of  test 
measurements  are  associated  with  various  types  of  reliability.  One  of  these 
is  stability  of  a  measure  administered  more  than  once,  estimated  by 
correlating  the  scores  obtained  from  a  group  of  examinees  administered  the 
same  test  on  two  occasions.  Another  is  "internal  consistency  reliability," 
which,  in  the  case  of  the  AFOQT,  is  computed  by  segmenting  the  subtests  into 
items  and  computing  their  equivalence.  A  third  is  the  "standard  error  of 
measurement"  (SEM),  a  measure  of  the  precision  of  a  test  score.  The  SEM 
indicates  the  area  in  which  an  examinee’s  "true"  score  is  expected  to  fall, 
given  obtained  scores  on  repeated  test  administrations.  An  examinee's 
obtained  score  is  expected  to  lie  within  one  SEM  on  each  side  of  the  true 
score  about  two-thirds  (68%)  of  the  time.  The  smaller  the  SEM,  the  more 
confident  one  can  be  that  different  scores  of  two  examinees  on  the  same  test 
represent  true  differences  in  aptitude.  These  different  types  of  reliability 
in  relation  to  the  AFOQT  Form  P  are  discussed  below. 

Stability 


The  test-retest  reliability  is  assumed  for  Form  P  from  data  for  Form  0,  as 
the  two  forms  are  comparable.  Arth  (1986a)  conducted  a  study  of  applicants 
for  officer  training,  of  whom  2,246  were  retested  on  Form  0  between  October 
1981  and  December  1983.  The  retest  group  differed  in  terms  of  the  intervals 
between  test  and  retest,  with  312  retesting  in  less  than  6  months,  1,300  in  6 
to  11  months,  443  in  12  to  17  months,  and  191  in  18  months  or  longer.  For  all 
retesters,  the  test-retest  correlations  on  the  composites  were  as  follows: 
Pilot  =  .812,  Navigator-Technical  =  .852,  Academic  Aptitude  =  .853,  Verbal  = 
.880,  and  Quantitative  =  .775.  Higher  or  lower  reliabilities  might  have  been 
obtained  had  the  estimates  been  computed  separately  for  groups  witn  different 
retest  intervals.  Allen  and  Yen  (1979)  noted  that  changes  in  lengths  of  time 
can  affect  test-retest  reliabilities  in  different  ways.  Other  factors  which 
may  have  influenced  the  magnitude  of  the  correlations  are  carry-over  effects 
such  as  recall,  practice,  coaching,  and  motivation.  Carry-over  effects  due  to 
motivation  are  probable  for  the  Arth  (1986a)  sample.  The  retesters  were  a 
self-selected  group  who  requested  an  opportunity  to  retake  Form  0,  presumably 
as  an  attempt  to  improve  their  scores  to  qualify  for  commissioning  or  aircrew 
training  programs.  Nevertheless,  the  reported  reliabilities  are  moderately 
high  and  suggest  stability  in  AFOQT  measurements  across  time. 

Internal  Consistency 

The  Kuder-Ri chardson  20  (KR-20)  formula  was  used  to  compute  the 
reliability  of  the  Form  PI  and  Form  P2  subtests  (Table  5),  and  a  formula 
developed  by  Wherry  and  Gaylord  (1943)  was  used  to  estimate  tne  reliability  of 
the  composites  (Table  6).  As  shown  in  Table  5,  the  reliabilities  of  the 
subtests  range  from  .75  to  .91  in  Form  PI  and  from  .71  to  .90  in  Form  P2. 

The  lower  reliabilities  were  for  Electrical  Maze  in  Form  PI  and  for  General 
Science  in  Form  P2,  and  both  high  reliabilities  were  for  Table  Reading.  (It 
shouLd  be  noted  that  KR-20  reliabilities  tend  to  be  overestimated  for  speeded 
subtests  and  for  the  composites  that  contain  them.)  Tne  comparability  of  the 
Forms  PL  and  P2  subtests  with  respect  to  reliability  is  striking.  Six  of  the 
subtests — VA,  AR,  WTK,  MC,  TR,  and  AI — have  KR-20  reliabilities  that  differ  by 
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onI>  .010  or  less  between  the  two  forms.  The  differences  for  RC,  MK,  EM,  SR, 
IC,  RB  and  HF  range  from  .015  to  .028.  The  largest  differences  are  themselves 

inconsequential  (.040,  .044,  and  .048  for  BC,  DI,  and  GS,  respectively). 

These  findings  indicate  close  comparability  between  Forms  Pi  and  P2. 


Table  5.  Internal  Consistency  Reliability  (KR-20) 
and  Standard  Error  of  Measurement  (SEM) 


KR-20 

SEM 

Subtest 

Pi 

TT~ 

Pi 

P2 

Verbal  Analogies 

.7  93 

.791 

2.035 

2.010 

Arithmetic  Reasoning 

.877 

.867 

2.001 

2.024 

Reading  Comprehension 

.826 

.854 

2.082 

2.005 

Data  Interpretation 

.772 

.816 

2.148 

2.071 

Word  Knowledge 

.876 

.874 

2.026 

2.051 

Math  Knowledge 

.902 

.886 

1.955 

2.057 

Mechanical  Comprehension 

.767 

.770 

2.020 

2.000 

Electrical  Maze 

.748 

.720 

1.878 

1.839 

Scale  Reading 

.851 

.831 

2.646 

2.705 

Instrument  Comprehension 

.876 

.891 

1.853 

1.798 

Block  Counting 

.835 

.795 

1.765 

1.806 

Table  Reading 

.908 

.900 

2.154 

2.166 

Aviation  Information 

.799 

.806 

1.878 

1.859 

Rotated  Blocks 

.751 

.774 

1.601 

1.619 

General  Science 

.7  62 

.714 

1.921 

1.962 

Hidden  Figures 

.766 

.740 

1.508 

1.501 

Table  6.  Composite  Reliabilities  and  Standard  Errors  of  Measurement 
TSEM)  Estimated  From  Subtest  KR-20  and  Communality  Values 


Reliability  estimates  Standard  error  of  measurement 


Composite 

KR- 

■20 

Communality 

KR- 

■20 

Communality 

PI 

P2 

PI 

P2 

PI 

P2 

PI 

P2 

Pi  lot 

.958 

.957 

.867 

.869 

5.81 

5.75 

10.34 

10.04 

Na v-Tech 

.971 

.969 

.914 

.915 

6 . 54 

6.59 

11.26 

10.91 

Academic  Apt 

.961 

.963 

.902 

.913 

5.02 

5.01 

7.96 

7.68 

Verbal 

.931 

.937 

.834 

.856 

3.50 

3.50 

5.50 

5.29 

Quant itative 

.941 

.943 

.845 

.860 

3.54 

3.54 

5.73 

5.55 

In  the  absence  of  a  parallel  forms  reliability  measure  obtained  from 
retesting  the  same  examinees  on  both  Forms  PI  and  P2,  the  Wherry-Gaylord 
(W-G)  procedure  was  used  to  obtain  an  estimate  of  the  reliability  of  each 
composite.  The  formula  uses  reliabilities  for  the  subtests  in  a  given 


composite.  Subtest  reliabilities  were  computed  usino  two  methods.  First, 
the  KR-20  method  was  used  to  obtain  an  upper  bound  estimate  of  composite 
reliability .  TFfis  estimate  is  upper  bound  because  the  KR-20  values  are 
inflated  due  to  the  presence  of  both  speed  and  mixed  model  subtests  in  the 
AFOQT,  rather  than  power  subtests  exclusively.  Second,  communality 
estimates  were  computed.  The  communality  of  each  subtest  is  its  multiple 
correlation  (R)  with  all  other  subtests.  Communality  was  computed  by 
regressing  each  subtest  raw  score  against  the  raw  scores  for  the  remaining 
15  subtests.  The  communality  procedure  provides  a  lower  bound  W-G  estimate 
of  composite  reliability.  The  "true"  reliability  for  each  composite  would 
be  expected  to  fall  between  the  estimates  computed  by  the  two  methods. 

The  W-G  estimates  of  the  reliabilities  for  the  Form  P  composites  (see 
Table  6)  are  higher  than  those  for  their  component  subtests,  ranging  from 
.93  to  .97  for  Form  PI  and  from  .94  to  .97  for  Form  P2,  based  on  the  KR-20 
method.  Using  the  communality  method,  the  estimates  range  from  .83  to  .91 
in  Form  PI  and  from  .8b  to  .92  in  Form  P2.  The  Navigator- Technical  composite 
shows  the  highest  reliability  in  both  forms,  but  all  are  acceptably  high. 

The  estimated  composite  reliabilities  for  the  two  forms  are  either  identical 
(Pilot)  or  close  to  identical,  with  differences  of  .01  to  .02. 

Standard  Error  of  Measurement  (SEM) 


The  SEMs  are  highly  similar  for  the  Forms  Pi  and  P2  subtests  (Table  5). 
They  range  from  approximately  1.5  to  2.2  score  points  except  for  Scale 
Reading,  which  has  standard  errors  of  2.6  and  2.7  in  PI  and  P2,  respectively. 
In  practice,  the  SEM  columns  of  Table  5  would  be  used  to  estimate  tde  range 
of  an  examinee's  "true  score"  on  a  particular  subtest;  that  is,  the  obtained 
score  minus  and  plus  the  SEM.  Table  6  displays  the  SEMs  for  the 
composites.  These  are  particularly  important  in  that  it  is  the  composite 
scores  which  influence  selection  and  placement.  Form  PI  SEMs  tend  to  be 
higher  than  those  of  Form  P2,  but  the  differences  are  quite  small,  varying 
from  .01  to  .06  score  point  (KR-20  method)  and  from  .18  to  .35  score  point 
(communality  method).  These  findings  add  further  support  to  the 
comparability  of  Forms  PI  and  P2. 

Intercorrelations 


Additional  evidence  is  seen  in  the  pattern  of  intercorrelations  between 
composites  for  the  two  forms  (Table  7).  The  correlations  between  any  pair 
of  composites  for  Form  PI  did  not  differ  from  those  for  Form  P2  by  more  than 
.01.  Equivalent  correlations  were  obtained  for  7  of  the  10  pairs.  Results 
for  subtest  intercorrelations  were  somewhat  more  variable  (Appendix  E), 
although  differences  between  the  correlations  for  subtest  pairs  on  Form  PI 
versus  Form  P2  rarely  exceeded  .05. 

Standardization 


Scoring  Scales  and  Norms 


To  enhance  the  interpretability  of  results  obtained  by  examinees, 
scoring  scales  and  norms  have  been  defined  for  the  AFO^T.  In  the  absence  of 
scales  and  norms,  raw  test  scores  are  not  meaningful,  in  part  because  they 
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Table  7.  Intercorrelations  of  Composite  Raw  Scores 
for  AFOQT  Forms  PI  and  P2 


Navigator- 

technical 

Academic 

aptitude 

Verbal 

Quantitative 

Pilot 

Form  Pi 

.93 

.75 

.61 

.74 

Form  P2 

.93 

.74 

.61 

.73 

Navigator- Technical 

Form  Pi 

.84 

.62 

.90 

Form  P2 

.84 

.62 

.89 

Academic  Aptitude 

Form  Pi 

.90 

.91 

Form  P2 

.90 

.91 

Verbal 

Form  Pi 

.64 

Form  P2 

.64 

are  affected  by  the  difficulty  of  items  on  different  test  forms.  On  early 
forms  of  the  AFOQT,  raw  scores  were  converted  to  a  stanine  scale  (Valentine 
&  Creager,  1961) .  However,  the  convention  for  the  past  three  decades  has 
been  to  derive  percentile  scales  for  the  composites.  Percentile  scores  on 
the  AFOQT  are  expressed  in  terms  of  the  percentage  of  examinees  in  the 
standardization  sample  who  fall  below  a  given  raw  score  point. 

The  normative  base  for  AFOQT  Forms  Pi  and  P2  is  a  sample  of  military 
personnel  tested  on  Form  N.  As  described  by  Gould  (1978)  in  a  report  on  the 
development  and  standardization  of  Form  N,  the  2,618-case  sample  consists  of 
basic  airmen  (25.4%),  AFROTC  cadets  (27.1%),  AFA  cadets  (27.1%),  OTS  cadets 
(10.2%),  and  active  duty  second  lieutenants  (10.2%).  The  sample  was 
designed  to  represent  the  full  range  of  ability  expected  in  the  officer 
applicant  population. 

Equating 


Form  P  scores  were  linked  to  the  normative  group  using  equipercentile 
equating  to  Form  0  scores.  Form  0  had  been  equated  to  Form  N  in  a  previous 
study  (Rogers,  Roach,  &  Wegner,  1986).  Analytic  techniques  were  designed  to 
achieve  comparability  to  the  Form  N  composite  scales. 

The  Form  P  equating  process  proceeded  in  two  steps.  Initially,  a  set  of 
provisional  raw-to-percentile  score  conversions  were  developed  on  a  research 
sample  of  random  and  equivalent  groups  of  Forms  0,  PI,  and  P2  examinees 
enrolled  in  three  military  training  programs:  enlisted  personnel  in  Basic 
Military  Training  and  officer  candidates  in  OTS  and  AFROTC  (Steuck,  Watson, 

&  Skinner,  1988).  The  provisional  conversions  were  used  during  the  Initial 
Operational  Test  and  Evaluation  (IOT&E)  of  Form  P  conducted  with  officer 
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applicants  between  15  June  1987  and  31  October  1987.  Final  conversion 
tables  (see  Appendix  F)  were  produced,  based  on  analyses  of  the  applicants' 
data.  These  conversion  tables  were  implemented  in  August  1988,  when  Form  P 
officially  replaced  Form  0  as  the  operational  AFOQT. 

Validity 


There  are  several  approaches  to  the  estimation  of  test  validity. 
Predictive  validity,  the  ability  of  a  test  score  to  predict  future 
performance,  is  the  pertinent  measure  for  evaluating  the  AFOQT.  The  Pearson 
product-moment  correlation  is  generally  used  to  establish  a  test  score’s 
relationship  to  a  performance  measure.  The  higher  the  validity  coefficient, 
the  more  accurate  the  prediction,  given  a  representative  sample  of 
sufficient  size  to  ensure  reliable  and  stable  results.  Because  score  data 
are  obtained  for  samples  of  a  population,  an  obtained  validity  coefficient 
may  be  the  result  of  chance  factors  such  as  sampling  error  which  affect  the 
composition  c:  one  particular  sample,  and  therefore  may  not  be  a  true 
indicator  of  the  strength  of  the  relationship  between  a  test  score  and  a 
performance  measure.  The  probability  (p)  of  obtaining  a  positive  or 
negative  correlation,  when  the  actual  correlation  is  zero,  can  be  computed. 
If  that  probability  is  .05  or  less,  the  result  is  considered  to  be 
statistically  significant. 

The  predictive  validity  design  requires  a  criterion  of  performance, 
whether  in  training  or  on  the  job,  to  relate  to  test  scores.  Various 
criterion  groups  are  referenced  in  past  AFOQT  Form  0  validity  studies.  At 
this  time,  no  research  on  the  validity  of  Form  P  has  been  completed.  The 
comparability  demonstrated  between  Form  P  and  Form  0  permits  generalization 
of  Form  0's  validity  studies  to  Form  P  (Steuck  et  al.  ,  1988). 

Selection  for  Commissioning  Programs 


Validity  research  has  shown  the  AFOQT  Form  0  to  be  significantly 
predictive  of  training  in  the  Officer  Training  School  (OTS)  and  the  Air 
Force  Reserve  Officer  Training  Corps  (AFROTC)  (Cowan,  Barrett,  &  Wegner, 
1989,  1990). 

OTS.  Passing  scores  on  the  AFOQT  and  a  Bachelor's  degree  from  an 
accredited  university  or  college  are  required  for  application  to  OTS. 
Research  was  conducted  to  validate  the  OTS  process  of  selecting  college 
graduates  for  the  12-week  training  program  (Cowan  et  al.,  1990).  The 
predictors  studied  included  the  percentile  scores  on  the  five  AFOQT 
composites.  Performance  criteria  were  measures  of  success  in  OTS: 

(a)  a  dichotomy  reflecting  graduation  or  elimination  from  training, 

(b)  final  grade  on  academic  material,  (c)  a  training  effectiveness  rating 
from  instructors,  and  (d)  an  indicator  of  whether  or  not  the  cadet  was  a 
distinguished  graduate.  Sample  sizes  varied  by  criterion  (n  =  3,200  to 

rv  =  4,500  cadets)  . 

Most  AFOQT  composites  were  found  to  be  significantly  predictive  of  all 
criteria,  with  the  highest  correlations  obtained  between  the  Academic 
Aptitude  and  Verbal  composites  and  final  grade  in  training  (r  =  .38  ana 
r  =  .43,  respectively).  Validities  for  other  criteria  were  Tower,  with  most 
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ranging  between  .10  and  .20  and  significant  at  £  <  .01.  The  lower 
correlations  were  not  unexpected  due  to  the  restricted  variance  of  both  the 
dichotomous  criteria  and  the  rating  scale  used  by  instructors.  For  example, 
the  vast  majority  of  cadets  graduate  from  training  and  few  qualify  for 
distinguished  graduate  status.  Another  factor  impacting  the  magnitude  of 
the  validities  was  restriction  in  range  from  use  of  the  AFOQT  to  determine 
qualifications  for  the  officer  force.  OTS  cadets  are  a  highly  select 
subgroup  of  applicants  for  commissioning  training.  Reported  correlations 
were  not  corrected  for  range  curtailment  on  the  test.  Overall,  the  finding 
that  test  scores  consistently  related  to  training  performance  attest  to  the 
validity  of  the  AFOQT  as  an  OTS  selector. 

AFROTC.  The  AFROTC  administers  a  2-year  Professional  Officer  Course 
(POCl  to  prepare  accepted  candidates  for  officer  duty.  POC  candidates  are 
evaluated  by  a  Quality  Index  Score  (QIS),  which  consists  of  six 
differentially  weighted  variables.  These  are  the  Verbal,  Quantitative,  and 
Academic  Aptitude  composites  of  the  AFOQT;  the  Scholastic  Aptitude  Test 
(SAT)  score;  cumulative  grade  point  average  (GPA);  and  a  Detachment 
Commander’s  rating,  which  is  an  overall  rating  based  on  the  first  five 
variables  and  a  personal  interview.  Research  was  conducted  to  validate 
these  predictors  against  training  and  post-commissioning  performance 
variables  (Cowan  et  al.,  1989). 

Validity  analyses  for  the  AFOQT  composites  were  reported  for  samples  of 
1,000  to  9,500  AFROTC  cadets.  The  criteria  were  completion/non-completion 
of  the  POC;  an  instructor  rating  of  training  performance;  distinguished 
graduate  status;  and  three  experimental  supervisory  ratings  of  job 
performance,  potential  for  career  progression,  and  job  motivation. 

Study  results  showed  that  every  criterion  except  job  motivation  was 
significantly  predicted  by  one  or  more  of  the  AFOQT  composites.  The 
Academic  Aptitude  composite  offered  the  greatest  predictability  of  the  three 
AFOQT  composites,  with  correlations  near  .15  for  instructor  rating, 
distinguished  graduate,  and  potential  for  career  progression  criteria.  The 
correlations  obtained  for  AFROTC,  like  those  observed  for  OTS  performance, 
are  likely  to  be  substantial  underestimates  of  the  predictive  power  of  the 
AFOQT.  Approximately  25%  of  the  QIS  used  for  AFROTC  cadet  selection 
decisions  was  due  directly  to  AFOQT  scores.  In  combination,  selection  on 
AFOQT  aptitudes  and  similar  ones  measured  by  the  SAT  produces  a  fairly 
homogeneous  group  of  cadets.  The  impact  of  aptitude  curtailment  was  not 
accounted  for  in  the  AFROTC  validities. 

Rated  and  Non-Rated  Training 

The  criterion-related  validity  of  the  AFOQT  as  a  selection  instrument 
for  Undergraduate  Navigator  Training  (UNT),  Undergraduate  Pilot  Training 
(UPT),  and  non-aircrew  technical  training  has  been  established  by  several 
studies,  summarized  below. 

Undergraduate  Navigator  Training  (UNT).  The  Navigator-Technical 
composite  is  one  of  the  primary  selectors  for  UNT.  A  study  by  Shanahan  and 
Kantor  (1986)  evaluated  all  the  AFOQT  composites  against  four  UNT  standards. 
The  criterion  measures  were  training  outcome  (graduation/elimination), 
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average  classroom  lesson  score,  average  simulator  lesson  score,  ana  average 
flying  lesson  score.  All  of  the  AFOQT  composites  were  found  to  correlate 
significantly  with  the  graduation/elimination  performance  measure.  Ttie 
correlations  for  the  composites  with  that  measure,  uncorrected  for  range 
restriction  on  the  tests,  were  between  .10  and  .19,  all  significant  at 

_p  <  .01. 

UNT  classroom  lesson  and  UNT  simulator  lesson  grades  were  also  well 
predicted  by  the  composites,  especially  by  Quantitative  (uncorrected  r  =  .40 
and  .32,  respectively).  The  Pilot  and  Navigator- Technical  composite  scores 
were  the  most  predictive  for  UNT  flying  lesson  grades  (uncorrected  r  =  .21 
and  .24,  respectively). 

The  results  of  hierarchical  regression  analyses  provided  evidence  that 
when  Navigator- Technical  scores  were  combined  with  Quantitative  scores,  the 
prediction  of  the  various  performance  criteria  was  improved.  The  multiple 
correlations  (R)  thus  obtained  were  .20  for  graduation/elimination,  .43  for 
classroom  lessen  grades,  .34  for  simulator  lesson  grades,  and  .26  for  flying 
lesson  grades. 

A  study  of  the  validity  of  AFOQT  subtests  and  composites  for  632  UNT 
attendees  was  conducted  by  Arth,  Steuck,  Sorrentino,  and  Burke  (in 
preparation) .  The  AFOQT  subtests  with  the  highest  correlations  (corrected 
for  restriction  of  range)  with  a  UNT  pass/fail  measure  were  MK  (.21), 

AR  (.23),  BC  (.24),  and  SR  (.25).  The  Pilot,  Navigator- Technical,  and 
Quantitative  composites  had  corrected  correlations  with  UNT  outcome  of 
.25,  .27,  and  .23,  respectively. 

Undergraduate  Pilot  Training  (UPT).  Candidates  for  OTS  and  UPT  who  do 
not  have  a  private  pilot  license  must  successfully  complete  the  Flight 
Screening  Program  (FSP)  prior  to  entering  OTS.  The  validity  of  the  Pilot 
composite  in  predicting  UPT  outcome  for  719  attendees  of  FSP  and  UPT  between 
1982  and  1986  was  investigated  by  Sawin  (1987).  The  AFOQT  Form  0  Pilot 
composite  percentile  score  had  a  statistically  significant  correlation  with 
UPT  outcome  (uncorrected  r  =  .15,  p  <  .001). 

Arth  et  al.  (in  preparation)  studied  the  validity  of  the  various  AFOQT 
subtests  and  composites  in  predicting  UPT  outcome  for  695  officers.  The 
subtests  that  best  predicted  the  UPT  pass/fail  outcome  were  Instrument 
Comprehension  (corrected  r  =  .38)  and  Aviation  Information  (corrected 
r  =  .30),  both  of  which  aTso  have  face  validity  for  pilot  tasks.  The  Pilot 
and  Navigator- Technical  scores  were  the  best  predictors  among  the  composites 
of  UPT  outcome,  with  corrected  r's  of  .30  and  .24,  respectively. 

Non-Rated  Technical  Training.  AFOQT  data  were  obtained  for  9,029 
non-aircrew  officers  who  attended  29  entry-level  and  8  advanced  non-rated 
technical  training  courses  (TTC)  between  October  1979  and  December  1983 
(Arth,  1986b).  The  AFOQT  composites,  categorized  as  either  Rated  (Pilot  and 
Navigator- Technical)  or  Non-Rated  (Academic  Aptitude,  Verbal,  and 
Quantitative)  were  all  correlated  with  final  TTC  grades,  although  only  the 
non-rated  composites  are  used  for  selection  in  this  context.  Most  of  the 
course  grades  were  well  predicted  by  all  composites,  entry-level  courses 
particularly.  The  correlations  between  composites  and  grades  ranged  from 
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.01  to  .62,  with  the  most  frequent  intervals  being  .31  to  .40  (59 
correlations)  and  .41  to  .60  (57  correlations).  The  Academic  Aptitude 
composite,  with  28  correlations  of  .31  to  .55,  predicted  course  grades  best, 
on  the  whole.  The  Navigator-Technical  composite  was  the  next  best  predictor, 
with  24  correlations  that  ranged  from  .31  to  .62.  The  correlations  reported 
were  uncorrected  for  the  range  restriction  in  test  score  variance  resulting 
from  preselection  of  the  officer  samples  on  AFOQT  measures. 

Improvements  in  prediction  were  achieved  for  the  non-rated  composites  by 
combining  them.  For  20  of  the  37  courses,  a  linear  combination  of  Verbal 
and  Quantitative  scores  predicted  final  grades  significantly  better  than  did 
the  single  composites.  Another  noteworthy  result  was  that  in  tne  cases  of 
Air  Traffic  Control  and  one  Intelligence  class,  the  Pilot  and 
Navigator- Technical  scores  predicted  grades  better  than  did  the  non-rated 
composites,  singly  or  in  combination. 

Air  Weapons  Controllers.  The  development  of  a  selection  strategy  to 
improve  the  success  rate  of  Air  Weapons  Controllers  during  training  and  in 
field  assignments  entailed  obtaining  predictive  validity  statistics  for  968 
students  at  five  organizations  responsible  for  training  Air  Weapons 
Controllers  (Finegold  &  Rogers,  1985).  Pearson  product-moment  correlations 
were  computed  between  the  AFOQT  composites  and  criteria  consisting  of 
academic  grade,  success  (i.e.,  pass  or  fail),  and  student  class  rank.  All 
of  the  AFOQT  composites  were  significantly  related  to  all  of  the  performance 
criteria  at  the  p  <  .01  level.  Academic  Aptitude  was  the  most  consistently 
successful  compositfe  in  predicting  student  performance,  with  its 
correlations  ranging  from  .26  to  .38.  The  Navigator-Technical  composite  was 
the  next  most  successful  predictor,  with  correlations  from  .23  to  .39.  The 
performance  criterion  best  predicted  by  all  composites  was  class  ranking. 

Validity  Generalization  for  Non-Rated  Officers.  As  a  preliminary 
attempt  to  assess  the  comparability  of  AFOQT  validity  across  test  forms, 
Hartke  and  Short  (1988)  conducted  a  Schmidt-Uunter  meta-analysis  on  47 
different  validity  coefficients  obtained  for  AFOQT  Academic  Aptitude 
composite  scores  against  technical  training  grades.  The  Schmidt-Hunter 
procedure  calculates  a  weighted  mean  validity  coefficient  corrected  for 
study  differences  in  range  restriction  and  predictor/criterion  reliability, 
and  variance  due  to  sampling  error,  among  other  differential  effects.  The 
weighted  mean  validity  coefficient  is  assumed  to  be  the  best  estimate  of 
true  validity  if  the  variance  in  correlation  coefficients  attributable  to 
sampling  error  and  other  study  effects  accounts  for  most  (>75%)  of  the 
observed  variance  in  the  set  of  effects. 

The  AA  composite  score  data  were  collected  from  studies  in  which  either 
several  AFOQT  forms  were  involved  (L,  M,  N,  and  0)  or  there  was  no 
information  as  to  which  form  was  used.  Therefore,  in  this  preliminary 
study,  the  question  of  validity  generalizability  across  forms  could  not  be 
answered.  Predictor  and  criterion  reliability  data  and  score  variance  were 
also  not  available,  precluding  corrections  for  these  factors.  Nevertheless, 
the  meta-analysis  could  be  conducted  to  determine  the  generalizability  of  AA 
across  all  jobs  in  the  sample,  and  subgroups  of  jobs  for  major  occupational 
groupings . 
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The  results  of  this  study  indicated  that  true  validity  was  not  the  same 
across  all  job  titles,  because  although  a  weighted  mean  correlation  of  .d9 
was  obtained,  sampling  error  accounted  for  only  33%  of  the  variance.  The 
wide  range  of  job  occupations  in  the  sample  probably  accounted  for  this 
result  to  a  large  extent.  For  the  four  major  occupational  subgroups 
studied,  observable  sampling  error  variance  was  higher,  but  exceeded  7j/«  for 
only  one  subgroup  studied,  Intelligence  and  Security  Police.  The 
meta-analysis  for  this  subgroup  produced  a  weighted  mean  correlation  of  .44, 
with  100  percent  of  the  observed  variance  attributable  to  sampling  error. 

At  this  time,  then,  the  AA  composite's  validity  can  be  considered  to  be  the 
same  for  all  occupations  within  only  the  Intelligence  and  Security  Police 
subgroup.  The  results  of  this  study  also  indicate  that  although  AA  validity 
varies  across  Air  Force  specialties,  the  individual  true  validities  will 
vary  around  the  mean  weighted  validity  coefficient  of  .39.  Planned 
meta-analysis  research  on  the  AFOQT  will  include  all  the  composites,  larger 
data  sets,  more  complete  information  on  test  forms  and  test  statistics,  and 
more  homogeneous  job  title  groups. 

Construct  Validity 


In  psychological  testing,  a  "construct"  is  a  meaningful  attribute  (e.g., 
anxiety;  verbal  ability)  that  is  hypothesized  to  operate  dimensionally  in 
human  behavior.  Thus,  there  are  supposedly  differing  levels  of  "anxiety" 
and  "verbal  ability."  Construct  validity  can  support  the  existence  or  such 
theoretical  dimensions  through  several  techniques  that  demonstrate  how  tests 
hypothesized  to  measure  the  same  construct  will  group  together.  Whereas  the 
predictive  validity  of  a  personnel  selection  test  such  as  the  AFOQT  provides 
the  most  '..''impelling  evidence  to  support  mterences  regarding  tue  test's 
utility,  construct  validity  can  confirm  the  nature  of  the  attribute  being 
measured.  A  variety  of  statistical  evidence  supports  the  constructs  of  tue 
AFOQT  subtests  and  composites.  These  include  factorial  validity  of  the 
subtests,  intercorrelations  of  relatively  homogeneous  subtests,  and 
correlations  with  non-military  tests  and  measures. 

Factorial  Validity.  A  factor  analysis  of  AFOQT  Form  0  was  conducted  for 
a  random  sample  of  3,000  examinees  to  determine  the  dimensions  underlying 
the  16  subtests  (Skinner  &  Ree ,  1987).  Five  factors  identified  in  the 
results  strongly  suggested  the  factor  names  of  Verbal,  Quantitative,  Space 
Perception,  Aircrew  Interest/Aptitude,  and  Perceptual  Speed. 

Verbal  Analogies,  Reading  Comprehension,  and  Word  Knowledge  were  the 
tests  that  clearly  defined  the  Verbal  Factor  (i.e.,  hau  the  highest  loadings 
on  Factor  I).  Math  Knowledge,  Arithmetic  Reasoning,  and  Data  Interpretation 
clearly  identified  the  Quantitative  factor.  Factor  III,  Space  Perception, 
was  composed  of  Rotated  Blocks,  Block  Counting,  Electrical  Maze,  Hidden 
Figures,  and  Mechanical  Comprehension.  Factor  IV,  Aircrew  Interest/Aptitude 
was  best  defined  by  Aviation  Information  and  Instrument  Comprehension.  The 
Perceptual  Speed  factor  included  the  speeded  subtests,  Table  Reading  and 
Scale  Reading.  The  AFOQT  test  constructs  identified  by  tnese  tactor  names 
appear  to  have  been  well  validated  by  the  types  of  tests  thaL  lormed  each 
f  actor. 
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Subtest  Intercorrelations.  Subtest  intercorrelations  for  AFOQT  Form  P, 
obtained  from  officer  applicant  samples  of  3,21b  for  Form  Pi  and  2,97b  for 
Form  P2,  varied  from  .16  to  .76.  With  samples  this  large,  even  the  smaller 
correlations  are  significant  and  indicate  that  all  the  subtests  are 
measuring  something  in  common.  The  commonalities  might  be  hue  to  general 
ability  and  the  basic  verbal  ability  required  to  read  test  instructions, 
together  with  the  ability  to  follow  those  instructions  in  detail.  What  is 
of  interest  with  respect  to  construct  validity  is  whether  tests  measuring 
similar  attributes  have  high  correlations  with  each  other,  but  low 
correlations  with  measures  of  hypothetically  dissimilar  aptitudes.  The 
subtest  intercorrelation  matrices  for  Forms  PI  and  P2  in  Appendix  E  provide 
evidence  of  such  validity  for  the  constructs  suggested  by  the  AFOQT  factor 
analysis. 


The  verbal  aptitude  construct  in  the  AFOQT  appears  directly  or  by 
implication  in  the  subtest  names  of  Verbal  Analogies  (VA) ,  Word  Knowledge 
( WK ) ,  and  Reading  Comprehension  (RC).  The  highest  intercorrelations 
obtained  for  these  three  subtests  are  with  each  other.  The  lowest 
correlations  for  these  verbal  subtests  are  witn  Electrical  Maze  (EM),  a 
subtest  that  one  can  intuitively  perceive  as  requiring  little  verbal  power 
for  its  accurate  responses.  The  verbal  construct  seems  to  be  well 
established.  Relatively  high  correlations  are  often  observed  between  verbal 
subtests  and  math  or  scientific  types  of  subtests  on  multiple  aptitude 
batteries.  It  can  be  convincingly  argued  tnat  math  and  science  problems  ao 
requie  verbalization  to  some  degree;  hence,  the  moderately  high 
correlations  obtained  between  these  subtest  types  on  Form  P. 

The  quantitative  aptitude  construct  suggested  by  the  factor  analysis  was 
defined  by  the  Arithmetic  Reasoning  (AR),  Data  Interpretation  (DI)  and  Math 
Knowledge  (MK)  subtests.  These  subtests  tend  to  correlate  more  highly  with 
each  other  than  with  other  subtests.  The  lowest  correlations  of  AR,  DI,  and 
MK  are  with  Aviation  Information  (AI)  (.20  to  .34).  The  AI  test  is  a 
knowledge  test  that  does  not  require  mathematical  reasoning. 

The  construct  identified  as  aptitude  for  space  perception  relies  on  five 
subtests  for  its  definition:  Electrical  Maze  (EM),  Block  Counting  (BC), 

Rotated  Blocks  (RB),  Hidden  Figures  (riF),  and  Mechanical  Comprehension 
(MC).  Appendix  E  indicates  tnat  these  subtests  in  both  Forms  PI  and  P2 

correlate  highly  with  each  other  (<  .39),  and  with  the  Scale  Reading 
subtest,  which  also  measures  visuaT-perceptual  sKills.  Tne  lowest 
correlations  typically  found  for  the  tests  defining  the  space  perception 
construct  were  with  WK  and  AI,  subtests  that  clearly  do  not  require 
spatial-perceptual  skill. 

The  construct  of  aircrew  interest/aptitude  is  supported  by  the 
intercorrelations  of  MC,  IC,  AI,  and  CS .  These  range  from  .43  to  .63.  The 
content  of  these  subtests  appears  to  be  appropriate  to  the  interests  and 
aptitudes  involved  in  successful  aircrew  worK.  Certain  subtests  have  low 
correlations  with  AI  but  have  moderate  to  high  correlations  with  the  other 
subtests  supporting  this  construct  (ilF,  SR,  MK,  and  EM).  The  aircrew 
interest/aptitude  construct  appears  to  be  a  complex  one  that  does  not  exclude 
to  a  great  extent  the  other  aptitudes  measured  by  the  AFOQT. 
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Perceptual  speed  was  the  last  construct  suggested  by  the  factor 
analysis.  The  subtests  whose  descriptions  support  this  construct.  Table 
Reading  (TR)  and  Scale  Reading  (SR),  were  well  correlated  (r  =  .53  and  .55 
on  Form  PI  and  Form  P2 ,  respectively);  but  SR,  which  has  computational 
components,  was  more  highly  correlated  with  subtests  involved  in  tne 
quantitative  construct.  TR  also  correlated  well  with  BC ,  probably  in  part 
due  to  BC's  perceptual  aspect.  Also,  the  BC  test  has  been  shown  to  have  a 
speeded  component  (Skinner  &  Ree,  19b7).  The  subtests  with  the  lowest 
correlations  with  TR  and  SR  were  AI  and  WK,  neither  of  which  requires 
perceptual  speed  to  respond  to  its  items. 

For  the  most  part,  data  from  the  subtest  intercorrelations  show  validity 
for  the  AFOQT  constructs  in  that  subtests  requiring  similar  skills  tended  to 
be  more  highly  correlated  with  each  other  than  with  other  subtests,  and  in 
that  the  lowest  intercorrelations  tended  to  be  between  subtests  that  purport 
to  require  few,  if  any,  of  the  same  skills. 

Correlations  with  Civilian  Tests.  A  study  was  conducted  to  determine 
the  extent  to  which  the  AFOQT  composites  correlate  with  similar  measures  in 
the  Scholastic  Aptitude  Test  (SAT)  and  the  ACT  composite  of  the  American 
College  Testing  Program  (Diehl,  198b) .  The  subjects  were  AFROTC  cadets 
entering  the  Professional  Officer  Course  during  Fiscal  Year  1985  (FY8o). 
Analyses  were  conducted  for  1,907  cadets  tested  on  both  tne  AFOQT  and  SAT 
and  for  1,240  cadets  tested  on  both  the  AFOQT  and  ACT. 

The  highest  correlations  for  the  AFOQT  composites  were  found  between  the 
Academic  Aptitude  composite  and  similar  composites  derived  from  the  cadets' 
combined  performance  on  the  verbal  and  math  sections  of  the  civilian 
batteries  (Table  8).  The  AFOQT  Academic  Aptitude  composite  showed 
a  correlation  of  .80  with  the  ACT  Composite  and  with  the  SAT  Composite. 
Correlations  between  the  part  scores  were  only  slightly  lower.  The  AFOQT 
Verbal  and  Quantitative  composites  correlated  .77  and  .71,  respectively, 
with  their  SAT  counterparts  SAT  Verbal  and  SAT  Math.  Scores  on  the 
Navigator^- Technical  composite,  which  includes  the  subtests  of  the 
Quantitative  composite,  also  related  highly  to  performance  on  the  SAT  Math 
composite  (r  =  .61).  Some  correlations  were  substantially  lower,  as  might 
be  expectedT  For  example,  the  Pilot  and  Navigator- Technical  composites  did 
not  correlate  highly  with  the  SAT  and  ACT  Verbal  composites,  probably 
because  there  is  little  overlap  in  test  content. 


Table  8.  Correlations  of  AFOQT  Form  0  Composites 
'  with  Civilian  Tests 


AFOOT  composite 

ACT  composite 

SAT  composite 

SAT  verbal 

SAT  matn 

Pilot 

.416 

.400 

.237 

.449 

Navigator- Technical 

.566 

.547 

.326 

.614 

Academic  Aptitude 

.804 

.801 

.716 

.687 

Verbal 

.681 

.702 

.772 

.473 

Quantitative 

.683 

.660 

.429 

.710 
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The  high  cerrelaticr.c  between  the  AFOQT  composites  and  their 
content-parallel  composites  in  prominent  civilian  tests  provide  additional 
confirmation  of  the  construct  validity  of  the  AFOQT. 


IV.  ADMINISTRATION  AND  SCORING 

Administration 


Testing  Locations 


AFOQT  testing  locations  include  Military  Entrance  Processing  Stations 
(MEPS),  Air  Force  Reserve  Officer  Training  Corps  detachments  on  university 
campuses,  and  Consolidated  Base  Personnel  Offices  (CBPOs)  on  military 
installations.  The  annual  testing  load  is  approximately  35,000  to  40,000 
examinees. 

Administrators 


Test  administration  is  conducted  and  monitored  by  Personnel  Technicians 
(Air  Force  Specialty  Code  73270).  Qualifications  for  the  Test  Control 
Officers  (TCOs),  who  supervise  and  monitor  on-site  testing  programs,  include 
attainment  of  the  rank  of  Master  Sergeant  or  higher.  Test  Examiners  (TEs) 
responsible  for  actual  administration  must  be  at  least  Staff  Sergeant 
selectees.  Training  activities  for  TEs  include  familiarization  with  testing 
regulations  (AFR  35-8,  AJ.r  Force  Military  Personnel  Testing  System,  and  AFR 
30-17,  Safeguarding  CONTROLLED  ITEM  (Test  Material)  Information)  and  review 
of  the  manual  for  administration  (Department  of  the  Air  Force,  1987b).  The 
manual  specifies  requirements  for  the  testing  environment  and  preparation  of 
testing  materials.  The  local  TCO  supervises  practice  testing  sessions  with 
the  TE.  TCOs  at  Air  Force  major  commands,  as  well  as  the  Inspector  General, 
ensure  local  TCOs  comply  with  standards  for  testing  conditions  and 
administration  procedures  as  specified  in  AFR  35-8  and  the  manual  for 
administration. 

Standardized  Procedures 


Information  Pamphlet.  Prior  to  testing,  examinees  receive  an 
Information  Pamphlet  (Department  of  the  Air  Force,  1987a)  to  acquaint  them 
with  the  nature  of  the  AFOQT.  The  pamphlet  describes  the  purpose,  content, 
and  schedule  of  the  test  and  presents  five  sample  items  with  answers  for 
each  subtest. 

Test  Environment.  A  space  of  15  square  feet  (inclusive  of  aisles)  per 
examinee  is  required.  Desks  are  arranged  for  efficient  test  distribution 

and  collection.  Minimum  standards  of  lighting  and  ventilation  must  be  met, 
and  all  examinees  must  be  visible  to  the  TE.  If  waivers  are  requested  for 
testing  facilities  that  are  below  standard,  the  request  must  include 
documentation  which  describes  the  actions  to  be  taken  to  counteract  the 
deficiences.  If  test  interruptions  such  as  power  failure  occur,  the  TE 
provides  instructions  to  the  examinees  with  respect  to  handling  of  test 
booklets  and  test  rescheduling.  In  such  cases,  tde  TE  collects  and 
safeguards  the  test  materials. 
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Directions.  Examiners  at  all  testing  sites  are  directed  to  deliver  the 
test  directions  exactly  as  stated  in  the  manual  for  administration.  Each 
testing  session  begins  with  the  TE's  verifying  each  examinee's  identity  and 
having  the  examinee  sign  a  test  roster.  The  TE  then  distributes  all 
necessary  testing  materials  (i.e.,  test  booklet,  answer  sheet,  and  pencil). 
The  test  purpose  and  rules  and  schedule  of  testing  are  explained.  Each 
examinee  is  requested  to  certify,  by  signing  the  answer  sheet,  that  he  or 
she  is  physically  fit  to  take  the  examination.  Otherwise,  the  examinee  is 
excused  and  rescheduled  for  a  later  testing  date.  Following  collection  of 
biographic  and  military  personnel  data,  a  Privacy  Act  Statement  is  read . 

This  statement  describes  the  authority  to  conduct  AFOQT  testing  (10  Unitea 
States  Code  508,  509,  and  510,  and  Executive  Order  9397),  the  principal 
purpose  for  collection  of  personal  identity  information  including  Social 
Security  Number,  and  routine  uses  of  the  scores,  rjcaminees  are  informed 
that  disclosure  of  personal  information  is  voluntary  but  tnat  refusal  could 
result  in  denial  of  a  military  commission.  Finally,  testing  commences  witn 
subtests  being  administered  according  to  the  schedule  shown  in  Table  9.  The 
total  amount  of  time  required  to  administer  the  AFOQT  is  approximately  4.5 
hours. 


Table  9.  Testing  Schedule 


Administration  time 
(in  minutes) 

Testing  time 
(in  minutes) 

Total  time 
(in  minutes) 

Pretest"  Activities 

24 

24 

Verbal  Analogies 

1 

8 

9 

Arithmetic  Reasoning 

1 

29 

30 

Reading  Comprehension 

1 

18 

19 

Data  Interpretation 

1 

24 

23 

Word  Knowledge 

1 

5 

b 

Math  Knowledge 

1 

22 

23 

Break 

10 

10 

Mechanical  Comprehension 

1 

22 

23 

Electrical  Maze 

3 

10 

13 

Scale  Reading 

3 

15 

18 

Instrument  Comprehension 

3 

6 

9 

Block  Counting 

2 

3 

3 

Table  Reading 

2 

7 

9 

Aviation  Information 

1 

8 

9 

Rotated  Blocks 

2 

13 

15 

General  Science 

1 

10 

11 

Hidden  Figures 

2 

8 

10 

Collection  of  Materials 

2 

V 

TOTAL  TIME  REQUIRED 

1  hr  2  rain 

3  nrs  28  min 

4  nrs  30  min 
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Test  Security 


The  Office  of  Primary  Responsibility  (OPR)  for  test  administration, 
scoring,  and  reporting  of  results  for  the  AFOOT  is  the  Personnel  Testing 
Branch,  Personnel  Measurement  Division,  Headquarters  Air  Force  Military 
Personnel  Center  (HQ  AFMPC/ DPMYOT) ,  Randolph  AFB ,  Texas.  Administrative 
direction  and  oversight  are  provided  to  about  500  TCOs  assigned  throughout 
the  Continental  United  States  and  overseas. 


Test  material  and  data  are  treated  as  confidential  and  are  released  only 
to  authorized  personnel.  Further,  provisions  are  made  to  prevent  test 
compromise  and  to  ensure  scores  are  not  obtained  by  fraudulent  means.  TCOs 
and  TEs  are  fully  advised  of  their  responsibility  for  securing  materials  and 
scores  and  of  potential  punitive  action  that  may  ensue  due  to  test 
compromise  or  loss  of  materials.  All  test  administration  personnel  are 
informed  that  they  are  subject  to  administrative/disciplinary  action  under 
the  Uniform  Code  of  Military  Justice  (UCMJ)  as  specified  in  AFR  30-17,  wnich 
relates  to  test  compromise  or  loss  of  materials.  TCOs  and  TEs  may  not  take 
a  test  which  they  have  administered,  or  to  which  they  have  access,  without 
prior  approval  from  HQ  AFMPC/DPMYO T  or  until  6  months  have  elapsed  since 
access  to  the  test. 


TCOs  and  TEs  are  responsible  for  safeguarding  and/or  retaining  personal 
control  of  test  materials  at  all  times  and  must  adhere  Lo  regulatory 
procedures  for  reporting  any  examinee  misconduct  detected.  Examinee 
activities  that  constitute  test  compromise  are  described  to  examinees  prior 
to  testing. 


Scoring 


AFiXJT  Scoring  Procedure 


Official  scores  used  for  Air  Force  personnel  decisions  are  derived  and 
. eported  by  HQ  A F MFC /DPMYOT  to  recruiters,  detachment  commanders,  and  other 
authorized  users.  Examinees  record  their  test  responses  on  machine-scannable 
answer  sheets.  After  testing,  TEs  forward  the  answer  sheets  from  the 
testing  site  for  processing  at  a  central  facility  maintained  by  HQ 
AFMPC /DPMYOT.  Scoring  accuracy  is  verified  by  periodic  checks  of  the 
calibration  of  t lie  scanning  device,  and  of  associated  scoring  and  reporting 
software,  using  two  procedures:  (a)  processing  of  "dummy"  answer  sheets 
with  known  scores,  and  (b)  hand-scoring  of  examinees'  answer  sheets.  Due 
to  the  proven  accuracy  of  the  computerized  processing  procedures,  requests 
for  hand-scoring  and  rescoring  are  usually  denied.  In  anticipation  of  the 
possibility  that  scores  might  be  challenged,  however,  answer  sheets  are 
retained  for  a  period  of  0  months  after  testing,  along  witli  microfiche 
records  and  tape  files  of  scores  and  item  responses. 
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Quick  Score 


Prescreening  of  applicants  for  OTS  or  the  AFROTC  is  an  undertaking  to 
reduce  the  costs  associated  with  applicant  processing.  Recruiters  can  use 
the  unofficial  "Quick  Score"  estimates  to  make  decisions  about  how  to  manage 
applicants  while  waiting  to  receive  official  AFOQT  results  from  HQ 
AFMPC/DPMYOT.  For  example,  the  applications  of  high-scoring  examinees  who 
are  most  likely  to  meet  Air  Force  aptitude  entry  requirements  may  be 
expedited . 

Several  early  versions  of  a  prescreening  test  were  previously  developed 
for  predicting  AFOQT  scores.  The  Air  Force  Precommissioning  Screening  Test 
(AFPST),  a  screening  test  for  selection  to  navigator  training  and  to  the  Air 
Force  Academy  Preparatory  School,  was  a  short  version  of  the  Officer  Quality 
composite  (now  known  as  the  Academic  Aptitude  composite  of  the  AFOQT) 
(Valentine,  1961).  The  AFPST,  renamed  the  Pre-Enrollment  Test,  was  revised 
and  implemented  in  1965,  and  underwent  further  revision  in  1967  (Miller, 
1966,  1968).  The  Pre- Enrollment  Test  was  discontinued  in  April  1969. 

The  need  for  a  prescreening  device  recurred  in  the  early  1980s  witti  the 
implementation  of  AFOQT  Form  0.  Unlike  the  AFPST  and  the  Pre-Enrollment 
Test,  which  were  separate  and  shorter  versions  of  the  AFOQT,  the  Officer 
Screening  Composites  developed  for  Form  0  used  a  subset  of  tne  items  in  the 
operational  battery  (Rogers,  1985). 

The  Officer  Screening  Composites  were  used  from  July  1982  until  August 
1988,  when  Form  P  became  the  operational  officer  selection  test.  At  that 
time,  HQ  AFMPC/DPMYOT  requested  that  the  applicant  prescreening  procedure  be 
updated.  Research  was  initiated  by  the  Air  Force  Human  Resources  Laboratory 
to  improve  the  prescreening  procedure  for  Form  P. 

The  development  of  the  Form  P  Quick  Score  is  described  at  length  by 
Sperl  (1988)  and  by  Sperl  and  Ree  (1989).  Briefly,  two  tecnniques  for 
selecting  a  subset  of  items  from  the  AFOQT  battery  for  the  QuicK.  Score 
composites  were  compared  for  predictability  of  scores  on  the  full-lengtn 
test  and  for  gender  and  ethnicity  effects.  One  technique  identified  items 
from  each  subtest  by  random  selection;  the  second  identified  items  with  the 
highest  item-subtest  point  biserial  correlations,  a  strategy  similar  to  that 
used  to  construct  the  Form  0  Officer  Screening  Composites. 

Results  favored  the  random  item  selection  methodology.  Although  the  two 
techniques  (random  and  point  biserial)  both  yield  Quick  Score  composites 
which  correlate  well  with  examinees'  scores  on  the  corresponding  composite 
derived  from  the  full-length  test,  the  random  item  selection  technique  snows 
considerably  less  bias  in  prediction  for  gender  and  ethnic  groups.  Another 
improvement  offered  by  the  use  of  the  Form  P  Quick  Score  composites  over  the 
Form  0  Officer  Screening  Composites  concerns  the  representativeness  of  test 
coverage.  Selected  subtests  were  excluded  in  the  development  of  the  Form  0 
prescreening  device;  the  Quick  Score  for  Form  P  represents  all  subtests  in 
proportion  to  their  relative  contributions  to  the  total  item  count  on  the 
full-length  test. 
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Quick  Score  composites  based  on  the  random  item  selection  technique  were 
recommended  for  implementation  to  HQ  AFMPC/DPMYOT.  Additional  analyses  were 
conducted  to  express  or  convert  Quick  Score  composites  to  the  percentile 
metric  used  to  report  results  on  the  five  full-length  AFOQT  composites.  The 
needs  of  recruiters,  who  are  the  primary  users  of  the  Quick  Score  composites, 
were  a  major  consideration.  Recruiters  had  expressed  dissatisfaction  with 
the  Form  0  conversion  method,  which  used  a  confidence  interval  method  to 
establish  the  range  of  percentile  scores  within  wnich  raw  scores  from 
short-form  scoring  were  expected  to  fall  (Rogers,  1985).  The  alternate 
procedure  used  for  Form  P  is  an  expectancy  table.  It  reports  the  percentage 
of  examinees  obtaining  a  given  value  on  the  Quick  Score  composite  who  are 
expected  to  reach  or  exceed  selected  percentiles  on  the  corresponding  AFOQT 
composite.  Manuals  were  also  developed  to  instruct  TEs  on  the  correct 
procedures  for  computing  the  Quick  Score  composite  scores  and  to  guide 
recruiters  in  the  proper  interpretation  of  the  scores  using  the  expectancy 
tables  (Department  of  the  Air  Force,  1989a,  1989b). 

Qualifying  Scores.  In  the  first  stage  of  applying  to  AFROTC  or  OTS ,  all 
examinees  take  all  of  the  AFOQT.  The  minimum  AFOQT  composite  scores  needed 
to  qualify  for  acceptance  are  the  15th  percentile  for  Verbal  (V)  and  the 
10th  percentile  for  Quantitative  (Q).  To  qualify  for  pilot/helicopter 
training,  an  applicant  must  achieve  a  minimum  Pilot  Composite  (P)  score  at 
the  25th  percentile  cad  a  minimum  Navigator- Technical  Composite  (N-T)  score 
at  the  10th  percentile.  Also,  the  examinee's  combined  P  and  N-T  scores  must 
be  at  least  at  the  50th  percentii°  (aFR  51-4).  That  is,  if  only  the  minimum 
score  is  achieved  on  one  of  the  two  composites,  the  applicant  must 
compensate  with  a  higher  score  on  the  other  composite  in  order  to  achieve 
the  5(Jtn  percentile  on  the  combined  scores. 

To  qualify  for  navigator  training,  the  applicant  must  achieve  a  minimum 
N-T  score  corresponding  to  the  25th  percentile,  a  minimum  P  score  at  the 
10th  percentile,  and  a  combined  N-T  and  P  score  at  the  50tn  percentile. 

Reporting  Scores.  Scores  are  reported  to  the  test-taker  verbally  or  in 
the  form  of  a  letter,  or  by  a  copy  of  a  Report  on  Individual  Personnel 
(RIP).  individual  CBPOs  report  scores  to  Active  Duty  Air  Force,  National 
Guard,  and  Air  Force  Reserve  personnel.  The  recruiter  at  a  MEPS  reports 
scores  to  the  civilian  applicants.  The  scores  of  ROTC  cadets  are  sent  to  HQ 
AFROTC. 

Score  [nterpretat  ion.  The  selection  and  commissioning  of  officers  from 
OTS  and  AFROTC  are  directly  affected  by  the  operational  use  of  AFOQT 
scores.  Proper  understanding  and  utilization  ol  these  scores  is  therefore 
of  high  importance.  The  predictive  information  provided  by  AFOQT  scores  is 
combined  with  other  data  (such  as  physical  fitness  and  euucational 
attainment,  among  others)  that  help  to  determine  personnel  decisions.  The 
AFOQT  provides  information  regarding  a  candidate's  aptitudes  for  a  specified 
program.  In  some  programs,  failing  to  achieve  an  established  AFOQT  minimum 
score  can  be  a  sufficient  reason  for  rejecting  a  candidate  (Arth,  1985). 

Extensive  studies  have  determined  the  specific  combinations  of  subtests 
that  maximize  the  prediction  for  certain  types  of  trair iug.  Computer 
scoring  yields  the  five  AFOQT  composites  that  represent  these  subtest 
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combinations.  With  respect  to  interpretation,  Air  Force  personnel  who  use 
the  scores  to  make  selection  decisions  about  officer  applicants  receive  a 
briefing  that  includes  composite  descriptions,  tue  meaning  of  percentiles, 
and  the  score  levels  that  are  considered  competitive.  Test-takers  who 
request  score  interpretation  receive  an  explanation  of  their  percentiles; 
that  is,  where  they  rank  relative  to  all  other  examinees  in  the  normative 
sample  on  the  various  composites. 

Because  of  the  way  in  which  the  AFOQT  is  constructed,  it  is  possible  to 
compare  the  score  differences  of  different  examinees  on  the  same  composite, 
and  those  of  the  same  examinee  on  different  composites.  Care  is  taken  in 
inferring  strengths  and  weaknesses  from  the  score  differences,  however. 
Because  the  score  differences  may  be  a  result  of  chance,  their  proportion  in 
excess  of  chance  must  be  estimated  before  diagnostic  inferences  are  drawn. 


V.  SUPPLEMENTAL  SOURCES  OF  AFOQT  INFORMATION 
Points  of  Contact  for  the  AFOQT 


The  three  major  Air  Force  agencies  involved  on  a  day-to-day  basis  with 
the  AFOQT  testing  program  are  listed  below.  The  functional  responsibilities 
of  the  agencies  differ  as  shown. 

1.  HQ  United  States  Air  Force 
Directorate  of  Personnel  Plans 
Policy  Division 
Washington,  DC  20330 

Develops  plans,  policies,  and  procedures  to  Implement  and  monitor  the  AFOQT. 
Establishes  and  coordinates  testing  requirements  and  procedures  needed  to 
implement  Air  Force  officer  procurement  policies. 

2.  HQ  Air  Force  Military  Personnel  Center  (AFMPC) 

Personnel  Measurement  Division 

Randolph  AFB ,  TX  78150-5000 

Executes  test  policy  and  oversees  the  operational  program  (test  printing, 
distribution,  administration,  scoring,  reporting,  security). 

3.  Air  Force  Human  Resources  Laboratory  (AFHRL) 

Manpower  and  Personnel  Division 

Brooks  AFB,  TX  78235-5601 

Performs  research  to  support  the  operational  test  and  testing  program.  Is 
responsible  for  test  development  and  revision.  Develops,  analyzes,  and 
evaluates  tests  and  associated  materials  (e.g..  Administration  Manual,  Quick 
Score).  Conducts  continuing  research  and  development  to  ensure  the  use  of 
the  AFOQT  and  its  psychometric  adequacy  for  officer  selection  and 
classification  decisions. 


35 


In  addition  to  the  references  cited  in  this  manual,  a  bibliography  of 
selected  references  relative  to  the  AFOQT  is  given  in  Appendix  G.  Most  of 
the  references  were  extracted  from  a  longer  list  of  publications  compiled  by 
Cowan  and  Sperl  (1989).  Their  paper  covers  50  years  (1937-1986)  of  research 
on  selection  and  classification  of  [J.S.  military  officers,  encompassing  many 
topics  in  addition  to  the  AFOQT. 

The  AFOQT  bibliography  is  arranged  chronologically  by  report  date  from 
1950  (about  the  time  that  Form  A  was  introduced)  to  the  present,  providing 
material  of  interest  both  to  those  readers  interested  in  gaining  an 
historical  perspective  on  the  test  and  to  those  concerned  primarily  with  the 
recent  or  current  officer  testing  program.  Most  of  the  reports  focus  on 
AFOQT  development,  standardization,  and  validation  topics. 

Readers  may  obtain  copies  of  the  reports  from: 

(Qualified  requesters — Department  of  Defense  personnel  and  registered 
DTIC  users) 

Defense  Technical  Information  Center  (DTIC) 

Cameron  Station 
Alexandria,  VA  22304-6145 
Commercial  (202)  274-7633 
AUTOVON  284-7633 

(Private  sector  requesters) 

(J.S.  Department  of  Commerce 
National  Technical  Information  Service 
5285  Port  Royal  Road 
Springfield,  VA  22161 
Commercial  (703)  487-4650 
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Table  A-l.  Content  of  AFOQT  Form  M  (1975) 


Subtest3 

Items 

P 

Composites^ 

N-T  OQ  V 

Q 

Booklet  1  (AFPT  972) 

Quantitative  Aptitude 

60 

X 

X 

X 

Booklet  2  (AFPT  973) 

Verbal  Aptitude 

60 

X  X 

Officer  Biographical  Inventory 

96 

X 

Booklet  3  (AFPT  974) 

Scale  Reading  ^ 

48 

X 

Aerial  Landmarks^ 

40 

X 

General  Science 

24 

X 

Booklet  4  (AFPT  975) 

Mechanical  Information 

24 

X 

X 

Mechanical  Principles 

24 

X 

X 

Booklet  5  (AFPT  976) 

Pilot  Biographical  Inventory 

50 

X 

Aviation  Information 

24 

X 

Visualization  of  Maneuvers'^ 

24 

X 

Instrument  Comprehension^  ^ 

24 

X 

Stick  and  Rudder  Orientation 

24 

X 

Total 

522 

aScale  Reading  and  Aerial  Landmarks  are  scored  according  to  the  number 
right  minus  one-fourth  the  number  wrong;  Visualization  of  Maneuvers  and 
Instrument  Comprehension  are  scored  as  the  number  right  minus  one-third  the 
number  wrong.  Other  subtests  are  scored  as  number  right  only. 

^Composites:  Pilot  (P),  Navigator- Technical  (N-T),  Officer  Quality 
(OQ),  Verbal  (V),  Quantitative  (Q). 

cNot  administered  to  female  applicants. 

^Speeded  subtests. 
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Table  A-2.  Content  of  AFOQT  Form  N  (1978) 


Subtesta 


Items 


Composites^5 

P  N-T  OQ  V  Q 


Booklet  1  (AFPT  982) 

Arithmetic  Reasoning  25 

Math  Knowledge  25 

Data  Interpretation  25 

Booklet  2  (AFPT  983) 

Word  Knowledge  25 

Reading  Comprehension  25 

Background  for  Current  Events  25 

Verbal  Analogies  25 

Booklet  3  (AFPT  984) 

Table  ReadingC  50 

Electrical  Maze c  30 

Block  Counting^  80 

Scale  Reading0  48 

Tools  25 

Mechanical  Comprehension  24 

Booklet  4  (AFPT  985) 

Rotated  Blocks  20 

Aerial  Landmarks0  40 

General  Science  24 

Instrument  Comprehension0  24 

Pilot  Biographic  and  Attitude  66 

Scale 

Total  606 


XX  X 

XX  X 

XX  X 


X 


X 

A 

X 

X 


X 

X 

X 

X 


X 

X 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 


X 

X 

X 

X 

X 


instrument  Comprehension  is  scored  according  to  the  number  right  minus 
one-third  the  number  wrong;  the  remaining  speeded  subtests  are  scored  as  the 
number  right  minus  one-fourth  the  number  wrong.  Other  subtests  are  scored 
as  number  right  only. 

^Composites:  Pilot  (P)  ,  Navigator- Technical  (N-T),  Officer  Quality 
(OQ),  Verbal  (V),  Quantitative  (Q) . 

°Speeded  subtests. 
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Table  A-3.  Content  of  AFOQT  Form  0  (1961) 


Composites*3 


Subtest3 

Items 

P 

N-T 

AA 

Verbal  Analogies 

2b 

X 

X 

Arithmetic  Reasoning 

25 

X 

X 

Reading  Comprehension 

25 

X 

Data  Interpretation 

25 

X 

X 

Word  Knowledge 

25 

X 

Math  Knowledge 

2  5 

X 

X 

Mechanical  Comprehension 

20 

X 

X 

Electrical  Maze 

20 

X 

X 

Scale  Reading 

40 

X 

X 

Instrument  Comprehension 

20 

X 

Block  Counting 

20 

X 

X 

Table  Reading 

40 

X 

X 

Aviation  Information 

20 

X 

Rotated  Blocks 

15 

X 

General  Science 

20 

X 

Hidden  Figures 

15 

X 

Total 

360 

aAll  subtests  are  scored  as  number  right  only.  No  subtests  are 
specifically  designated  as  speeded  because  all  subtests  contain  elements  of 

both  power  and  speed.  ,  .  4 

^Composites :  Pilot  (P),  Navigator- Technical  (N-T),  Academic  Aptitude 

(AA),  Verbal  (V),  Quantitative  (Q). 
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PART  1 


Verbal  Analogies 


DIRECTIONS:  This  part  of  the  test  measures  your  ability  to  reason  and  see  rela 
tionships  between  words.  You  are  to  choose  the  answer  that  best  completes  the 
analogy  developed  at  the  beginning  ofeach  question. 


1.  FINGER  is  to  HAND  as  TOOTH  is  to 


1-A 

tongue. 

1-B 

lips. 

1-C 

nose. 

[-[) 

mouth. 

1-E 

molar. 

RACQUET 

is  to  COURT  as 

2-A 

tractor  is  to  Held. 

2-B 

blossom  is  to  bloom 

2-C 

stalk  is  to  prey. 

2-D 

plan  is  to  strategy. 

2-E 

moon  is  to  planet. 

SWEATER  is  to  CLOTHES  as 

3-A 

bottle  is  to  cork. 

3-B 

hand  is  to  finger. 

3-C 

shoe  is  to  foot. 

3-D 

rose  is  to  flowers. 

3-E 

dog  is  to  cat. 

ROW  is  to  BOAT  as  SAIL  is  to 

4-A 

ocean. 

4-B 

navigate. 

4-C 

rudder. 

4-D 

ship. 

4-E 

travel. 

FLY  is  to  AIRPLANE  as 

5-A 

drive  is  to  stake. 

5-B 

skate  is  to  slide. 

5-C 

push  is  to  fall 

5-D 

swim  is  to  float. 

f)-E 

rod  is  to  hook 
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PART  2 

Arithmetic  Reasoning 

1)1  RUCTIONS:  This  part  of  the  test  measures  mathematical  reasoning.  It  is 
concerned  with  your  ability  to  arrive  at  solutions  to  problems.  Each  problem  is 
followed  by  five  possible  answers.  Decide  which  one  of  the  five  answers  is  most 
nearly  correct. 


1.  A  field  with  an  area  of  420  square 
yards  is  twice  as  large  in  area  as  a 
second  field.  If  the  second  field  is  !5 
yards  long,  how  wide  is  it'.’ 


1-A 

7  yards 

1-B 

14  yards 

l-C 

28  yards 

1-D 

56  yards 

1-E 

90  yards 

4.  An  Air  Force  recruiting  station 
enlisted  450  people.  Of  these.  2S% 
were  under  20  years  old  and  32**- 
were  20  to  22  years  old.  How  man  v  of 
the  recruits  were  over  22  vears  old  ’ 


4-A 

130 

4-B 

140 

4-C 

175 

4-D 

180 

4-E 

270 

4..  A  passenger  plane  can  carry  two  tons 
of  cargo.  A  freight  plane  can  carry 
five  tons  of  cargo.  If  an  equal  number 
of  both  kinds  of  planes  are  used  to 
ship  105  tons  of  cargo  and  each  plane 
carries  its  maximum  cargo  load,  how 
many  tons  of  cargo  are  shipped  on  the 
passenger  planes’ 


2-A 

1 5  0  tons 

2-B 

30.0  tons 

2-C 

42.0  tons 

2-D 

52  5  tons 

2-E 

75.0  tons 

5.  If  an  aircraft  travels  at  564  miles  per 
hour,  how  far  did  the  aircraft  fly  in 
900  seconds0 


5-A 

141  miles 

5-B 

226  miles 

5-C 

300  miles 

5-D 

451  miles 

5-E 

846  miles 

3.  A  typist  took  three  typing  tests.  The 
average  typing  speed  on  these  three 
tests  was  48  words  per  minute.  If  the 
typist's  speed  on  two  of  these  tests 
was  52  words  per  minute,  what  was 
the  typist's  speed  on  the  third  test0 


3-A 

46  words  per  minute 

3-B 

44  words  per  minute 

3-C 

42  words  per  minute 

3-D 

40  word',  per  minute 

3-E 

in  W'  rd'  per  minute 
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PART  3 


Reading  Comprehension 

I ) I  It  Kl'  I  IONS:  Ibis  part  of  Lh>-  tost  measures  your  abi li ty  to  read  and  understand 
I'.n  •»•»*;« |ih.->  for  each  i|itosti<>n.  yui  are  to  choose  the  answer  that  best  completes  the 
me  mi np  •■!  the  paragraph 

]  If  tiles  ,iie  t"  ! u  :e  te  n  e fleet i vel v.  organizations.  like  other  -.v slem>.  must  achieve 
a  natural  harmony  er  coherence  among  their  component  parts.  The  structural 
and  situational  elements  of  an  effective  urbanization  form  themselves  into  a 
tightly  knit,  highly  cubes  ve  package.  An  organization  whose  parts  are 
mismatched,  however,  cannot  carry  out  its  missions.  If  managers  are  to  design 
effective  organizations,  they  need  to 

I  -A  simphly  organizational  structures 
1  -li  encourage  greater  specialization  oi  labor. 

1  emphasize  the  fit  of  organizational  parts. 

1  -1 )  int roduee  more  technological  innovations 

1-K  reduce  the  span  of  control  in  the  organization. 


first.  Cf  stridium  Botulinum,  the  bacterium  that  produces  the  poison,  must  be 
present.  These  bacteria  are  widespread  in  the  environment  and  are  considered  by 
some  to  be  everywhere,  in  fact.  Second,  the  bacterium  that  produces  the  deadly 
toxin  must  be  treated  to  an  atmosphere  that’s  free  of  oxygen  and  to  temperatures 
that  are  ju.st  warm  enough  but  not  too  warm.  Those  conditions  have  to  be  held 
long  enough  lor  the  toxin  to  develop.  Acid  will  prevent  the  growth  of  the 
orgam  an  and  the  production  of  toxin  The  following  condition  is  necessary  for 


Botulism 

to  develop 

J-A 

presence  of  oxygen . 

J-B 

a  briel  period  of  time 

1-1 ' 

1  resenee  of dic'd 

d-i) 

warm  temperatures 

J-K 

exposure  to  lare  bacteria 

i.  Due  to  our  short  life  span  of  seventy  odd  years  it  is  easy  for  human  beings  to 
think  ol  earth  as  a  planet  which  never  changes  Yet  we  live  on  a  dynamic  planet 
with  many  factors  contributing  to  charge.  We  know  that  wind  and  rain  erode 
and  shape  our  planet.  Many  other  forces  are  also  at  work,  such  as  volcanic 
activity  temperature  flu*  '.nations,  and  even  extraterrestrial  interaction  such  as 


meteors  ; 

and  grav  i tationa  1  lorces.  The  earth,  in  actuality,  is  a 

large  rock 

t-A 

in  . i 

st, til 

■  of  i  nertia 

lit 

whh 

i  h  is  i 

ijun  k  ly  eroding 

whu 

i'll  is 

i-V'il’vin^ 

1) 

Mill: 

■  1.  i  . 

>uhjr«  t  b » 

temperature  fluctuations  cause 

d  by  inlcrpl 

ilit»- 

1  nil 

■  in 

if 

wlii< 

h  1 . 

..ihj-M  1  | . .  1 

wind  caused  by  meteor  activity 
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4.  Mustela  nigripes.  the  rarely  seen  black-footed  ferret,  is  often  confused  with 
Mustela  putorius,  the  common  European  polecat.  It  is  true  that  these  two 
mammals  resemble  each  other  in  some  ways.  However,  they  are  two  distinct  and 
separate  species  with  differences  in  color,  body  form,  and  other  attributes.  Who 
knows  how  many  sightings  of  the  black-footed  ferret 

4-A  were  the  result  of  seeing  the  European  polecat  running  loose? 

4-B  were  of  species  other  than  the  common  European  polecat? 

4-C  were  made  of  a  related  species  of  the  same  form  and  color? 

4-D  were  instead  sightings  cf  the  Mustela  nigripes? 

4- E  were  due  to  the  European  polecat  destroying  their  habitat? 

5.  One  theory  that  explains  the  similarities  between  Mayan  art  and  ancient 
Chinese  art  is  called  "diffusion."  This  theory  evolves  from  the  belief  that 
invention  is  so  unique  that  it  happens  only  once,  then  is  "diffused”  to  other 
cultures  through  travel,  trade,  and  war.  This  theory  might  explain  why 

5- A  the  airplane  and  birds  both  have  wings. 

5-B  certain  artifacts  in  Central  America  resemble  those  found  in  Southeast 
Asia. 

5-C  most  great  art  comes  from  Europe,  where  there  is  much  travel  between 
countries. 

5-D  rivers  in  South  America  and  Africa  have  some  similar  features. 

5-E  England,  being  so  remote  in  the  Middle  Ages,  is  the  only  country  to 
have  castles. 
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PART  4 


Data  Interpretation 


l)i  KKCTIONS:  This  part  of  the  test  measures  your  ability  to  interpret  data  from 
tables  and  graphs.  Each  table  and  graph  is  followed  by  two.  three,  or  four  questions 
pertaining  t..  that  table  or  graph  only. 


Ration  A 

Ration  B 

Ration  C 

Ration  D 

4  5 

2.8 

3.5 

6.2 

6  5 

3.6 

7.6 

4.6 

3  3 

4.9 

5.3 

5.2 

4.9 

7.1 

5.5 

6.1 

5.8 

3.7 

4.1 

7.0 

2.5 

5.1 

6.7 

3.9 

4.2 

6.0 

4.5 

5.9 

4.7 

4.7 

6.5 

7.1 

3.8 

6.1 

5.7 

5.9 

53) 

5.9 

tU 

45.5 

49.0 

55.3 

60.0 

Four  groups  of  ten  rabbits  were  each  fed  a  complete  commercial  ration  from  birth 
until  5  months  of  age.  Each  animal  was  weighed  at  birth  and  at  five  months.  The 
weight  of  the  animal  at  birth  was  subtracted  from  its  weight  at  five  months.  These 
differences,  in  pounds,  are  shown  above  for  each  rabbit. 


1.  Which  ration  appears  to  be  the  best? 


1-A 

Ration  A 

1-B 

Ration  B 

1-C 

Ration  C 

l-D 

Ration  D 

2.  The  average  weight  gain  of  the  rabbits  fed  Ration  B  is 


2-A 

4.55  lbs 

2-B 

4.90  lbs 

2-C 

5.53  lbs 

2-D 

5.44  lbs 

2-E 

6.00  lbs 
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1  l  he  :,Vl‘r  ,a>'  (,i!Tr  rf|u'r  in  wt-Uht  gain  between  the  group  of  rabbit.-,  who  trained 
the  most  and  the  group  who  gained  the  least  was 


1- A 

A. do  lbs. 

d-B 

u  47  lbs. 

d-C 

||  47  1 8s. 

d-D 

1  10  lbs. 

d-E 

1  4  7  lbs. 

4  I  he  individual  rabbit  showing  the  least  gain  was  fed 

4-A  Ration  A 

4-B  Ration  B 

4-C  Ration  C. 

4-U  Ration  L) 


)  I  he  individual  rabbit  that  gained  the  most  weight  gained 

b-A  ij.D  lbs 

b-B  b  .5  lbs 

b-C  7.1  lbs 

b-D  7.  bibs. 

5-E  8.1  lbs. 
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I’AKT  5 


Word  Knowledge 


1)1  KKC  I  IONS:  I  his  pari  < it  the  tost  measures  verbal  comprehension  involving  your 
i K 1 1 1 1 \  t  ■  understand  written  lanju  140.  for  eai  h  question.  choose  the  answer  that 
nit  .in  -  iho  same  as  the  capitalized  word 


CRIMSON 

1  -  A 

crisp 

1-li 

neatly  dressed 

1-C 

n*ddi  sh 

l-l) 

'.■'liortul 

1-K 

lively 

CRANK 

J-A 

Start 

2-H 

'.'hane,. 

2-C 

ci. nti  hue 

2-1) 

stop 

2- K 

fold 

FOR  IMG 

1 1 

l-A 

two  days 

i-R 

one  Week 

i-C 

two  Weeks 

!-!) 

one  month 

i-K 

two  months 

Si  I.I.KN 

4- A 

4i'.a vi sh  vellow 

4-H 

cOak  i  w  et 

4-C 

very  dirty 

4-1) 

angrily  silent 

4-K 

mildly  nauseat 

I'KKSK 

‘.-A 

pointed 

»-li 

trivial 

oral 

Ml) 

lengthy 

'.-K 

•"  '.a.'  i !  v 

52 


PART  6 


Math  Knowledge 


DIKKCTIONS:  This  part  of  the  test  measures  your  ability  to  use  learned 
mathematical  relationships.  Kach  problem  is  followed  by  five  possible  answers. 
Decide  which  one  ol  the  live  answers  is  most  nearly  correct. 

1  l'he  lirst  dipit  of  the  square  root  of  59043  is 

I -A  1 

1-1!  2 

1  -C  3 

1-1)  4 

1-K  ') 


2  The  distance  in  miles  around  a  circular  course  with  a  radius  of  35  miles  is  (use 
pi  =  22/7) 

2- A 

2-H 

2-C 

2-1) 

2-K 

3  The  expression  "3  factorial"  equals 


1  10  mi  les. 
\ r>b  miles. 
220  miles. 
440  mi  les. 
880  miles. 


3-A 

3-B 

3-C 

3-1) 

3- K 

4  Solve  tor  x.  2x  7  -  2x'-' 

4- A 

4-B 

4-C 

4-1) 

4- K 

5.  The  reciprocal  of  5  is 

5-  A  0.1. 

5-11  0.2. 

5-C  0.5. 

5-1)  1.0. 

5-K  2.0. 


1  7 

2  7 
2 
7 

14 


1  9 
1  6 

H 

9 
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PART  7 


Mechanical  Comprehension 

1)1  RUCTIONS:  This  part  of  the  test  measures  your  ability  to  learn  and  reason  with 
mechanical  terms.  Choose  the  answer  that  best  completes  the  statement.  Also 
included  in  this  part  of  the  test  are  diagrams  of  mechanical  devices.  Following  each 
diagram  are  several  questions  or  incomplete  statements.  Study  the  diagram 
carefully  and  select  the  choice  that  best  answers  the  question  or  completes  the 
statement. 


L  If  gear  R  is  the  driver,  at  the  moment  shown, 
gear  S  is 

1-A  not  moving. 

l-B  jammed. 

1-C  moving  at  its  highest  speed. 

1-D  moving  in  the  same  direction  as 

gear  R. 

1-E  moving  in  the  opposite  direction 

as  gear  R. 


3.  As  shaft  S  makes  one  complete  turn  from  the  posi¬ 
tion  shown,  C  moves 

3-A  left  and  then  right. 

3-B  right  and  then  left. 

3-C  up  only. 

3-D  down  only. 

3-E  up  and  down. 
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B  *  25  pounds 


Smooth 

Surface 


Smooth 

Surface 


(NOT  IN  equilibrium) 


4.  If  weight  B  is  to  slide  to  the  right,  what  change  must  be  made  in  the  diagram? 


4-A  The  slope  of  the  inclined  plane  under  A  must  be  increased. 

4-B  The  slope  of  the  inclined  plane  under  B  must  be  increased. 

4-C  The  radius  of  the  inner  pulley  must  be  decreased. 

4-D  The  radius  of  the  inner  pulley  must  be  increased  to  a  size  nearer  to 

that  of  the  outer  pulley. 

4-E  The  radius  of  the  outer  pulley  must  be  twice  that  of  the  inner  pulley. 


5  Ten-pound  weights  are  each  suspended  from  a  ceiling  by  three  identical  springs. 
In  A.  the  extension  of  each  spring  is 


5-A 

nine  times  greater  than  in  B 

5-B 

three  times  greater  than  in  B 

5-C 

the  same  as  in  B 

5-D 

1/3  less  than  in  B 

5-E 

19  less  than  in  B 
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PARTS 

Electrical  Maze 


DIKKCTIONS:  This  is  a  test  of  your  ability  to  choose  a  correct  path  from  among 
several  choices.  In  the  picture  below  is  a  box  with  dots  marked  S  and  F.  S  is  the 
starting  point,  and  F  is  the  finishing  point.  You  are  to  follow  the  line  from  S. 
through  the  circle  at  the  top  of  the  picture,  and  back  to  F. 


[n  the  problems  in  tms  test,  there  will  be  Five  such  boxes.  Only  one  box  will  have 
a  line  from  the  S,  through  the  circle,  and  back  to  the  F  in  the  same  box.  Dots  on  the 
lines  show  the  only  places  where  turns  or  direction  changes  can  be  made  between 
lines.  If  lines  meet  or  cross  where  there  is  no  dot,  turns  or  direction  changes  cannot 
be  made.  Now  try  sample  problem  SI. 


SI. 


A  B  C  D  E 


The  first  box  is  the  one  which  has  the  line  from  S,  through  the  circle,  and  back  to  F. 
Therefore,  A  is  the  right  answer. 
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Each  diagram  in  the  test  has  only  one  box  which  has  a  line  through  the  circle  and 
back  to  F.  Some  lines  are  wrong  because  they  lead  to  a  dead  end.  Some  lines  are 
wrong  because  they  come  back  to  the  box  without  going  through  the  circle.  Some 
lines  are  wrong  because  they  lead  to  other  boxes.  Some  are  wrong  because  they 
retrace  the  same  line. 

Now  try  sample  problems  S2  and  S3. 


S2.  SJ. 


ABCDE  ABCDE 


For  sample  problem  S2,  the  correct  answer  is  D.  For  sample  problem  S3,  the  correct 
answer  is  B. 


ABCDE  ABCDE 
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PART  9 


Scale  Reading 

DIRECTIONS:  This  is  a  test  of  your  ability  to  read  scales,  dials,  and  meters.  You 
will  be  given  a  variety  of  scales  with  various  points  indicated  on  them  by  numbered 
arrows.  You  are  to  estimate  the  numerical  value  indicated  by  each  arrow,  find  the 
choice  closest  to  this  value  in  the  item  having  the  same  numbers  as  the  arrow,  and 
then  mark  your  answer  on  a  separate  sheet  of  paper.  .Mow  look  at  the  sample  items 
below. 


SI 


0 

20 

25 

0 

llillliml 

S2 


SI 

1 1 1 

■4-*  1 

60 

90 

0 

_L _ , _ 

3 

I.  .  1.1 

S4 

51.  A  6.00 
B  5.00 
C  4.25 
I)  2.25 
E  1.25 

52.  A  130 
H  12.0 
C  10.2 

D  1.3 
E  1.2 

53.  A  81.75 
R  79.5 
C  78.75 
I)  77.60 
E  67.50 

54.  A  1.75 
B  1.65 
C  1.50 

I)  75 

E  .65 


In  sample  item  SI  there  are  five  subdivisions  of  four  steps  each  between  0  and  20. 
The  arrow  points  between  the  long  subdivision  markers  representing  4  and  8.  Since 
it  points  to  the  marker  that  is  one  step  to  the  right  of  subdivision  marker  4.  it  points 
to  5.00.  This  is  choice  B  in  sample  item  Si. 


In  sample  item  S2  the  scale  runs  from  right  to  left.  There  are  five  subdivisions  of 
five  steps  each,  so  each  step  represents  .1.  and  the  arrow  points  to  the  marker 
representing  1.2.  This  is  choice  E  in  sample  item  S2. 

In  sample  item  S3  the  arrow  points  between  two  markers.  You  must  estimate  the 
fractional  part  of  the  step  as  accurately  as  possible.  Since  the  arrow  points  halfway 
between  the  markers  representing  77.5  and  80.0.  it  points  to  78.75.  This  is  choice  C 
in  sample  item  S3. 


In  sample  item  S4  each  step  represents  5.  but  the  steps  are  of  unequal  width  with 
each  step  being  two-thirds  as  wide  as  the  preceding  one.  Therefore,  the  scale  is 
compressed  as  the  values  increase  The  arrow  is  pointing  to  a  position  halfway 
between  the  marker  representing  5  and  10,  but  because  of  the  compression  of  the 
scale  the  value  ot  this  p-ant  mu  ,i  be  less  than  75.  Actually,  it  is  .65.  which  is  choice 
E  m  sample  item  54 
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PART  10 


Instrument  Comprehension 


DIRECTIONS:  This  test  measures  your  ability  to  determine  the  position  of  an 
airplane  in  flight  from  reading  instruments  showing  its  compass  heading,  its 
amount  of  cl'mb  or  dive,  and  its  degree  of  hank  to  right  or  left.  In  each  item  the  left- 
hand  dial  is  labeled  ARTIFICIAL  HORIZON.  On  the  face  of  this  dial  the  small 
aircraft  silhouette  remains  stationary,  while  the  positions  of  the  heavy  black  line 
and  the  black  pointer  vary  with  changes  in  the  position  of  the  airplane  in  which  the 
instrument  is  located. 

The  heavy  black  line  represents  the  HORIZON  LINE,  and  the  black  pointer 
shows  the  degree  of  BANK  to  right  or  left. 


If  the  airplane  is  neither 
climbing  nor  diving,  the 
horizon  line  is  directly 
on  the  silhouette's  fuse¬ 
lage,  as  in  dial  1,  below. 


HORIZON 

Dial  1 

If  the  airplane  has  no 
bank,  the  black  pointer 
is  seen  to  point  to  zero,  as 
in  dial  l,  above. 


If  the  airplane  is  climbing, 
the  fuselage  silhouette  is 
seen  between  the  horizon 
line  and  the  pointer,  as  in 
dial  2,  below.  The  greater 
the  amount  of  climb,  the 
greater  the  distance  be 
tween  the  horizon  line  and 
the  fuselage  silhouette. 


HORIZON 

Dial  2 

If  the  airplane  is  banked  to 
the  pilot’s  right,  the 
pointer  is  seen  to  the  left  of 
zero,  as  in  dial  2,  above. 


If  the  airplane  is  diving, 
the  horizon  line  is  seen 
between  the  fuselage 
silhouette  and  the  point 
er,  as  in  dial  3,  below. 
The  greater  the  amount 
of  dive,  the  greater  the 
distance  between  the 
horizon  line  and  the 
fuselage  silhouette. 


HORIZON 

Dial  J 

If  the  airplane  is  banked 
to  the  pilot's  left,  the 
pointer  is  seen  to  the 
right  of  zero,  as  in  dial  3. 
above. 

is  always  at  right  angles  to 


The  HORIZON  LINE  tilts  as  the  aircraft  is  banked  and 
the  pointer. 

Dial  1  above  shows  an  airplane  neither  climbing  nor  diving,  with  no  bank. 

Dial  2  above  shows  an  airplane  climbing  and  banking  45  degrees  to  the  pilot's 
right. 

Dial  3  above  shows  an  airplane  diving  and  banked  45  degrees  to  the  pilot's  left. 
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On  each  item  the  right  hand  dial  is  labeled  COMPASS.  On  this  dial,  the  arrow 
shows  the  compass  direction  in  which  the  airplane  is  headed  at  the  moment.  Dial  4 
shows  it  headed  north;  dial  5  shows  it  headed  west;  and  dial  6  shows  it  headed 
northwest 


Each  item  in  this  test  consists  of  two  dials  and  four  silhouettes  of  airplanes  in 
flight.  Your  task  is  to  determine  which  one  of  the  four  airplanes  is  MOST  NEARLY 
in  the  position  indicated  by  the  two  dials.  YOU  ARE  ALWAYS  LOOKING  NORTH 
AT  THE  SAME  ALTITUDE  AS  EACH  OF  THE  PLANES.  EAST  IS  ALWAYS  TO 
YOUR  RIGHT  AS  YOU  LOOK  AT  THE  PAGE.  Item  X  is  a  sample.  In  item  X  the 
dial  labeled  ARTIFICIAL  HORIZON  shows  that  the  airplane  is  NOT  banked,  and  is 
neither  climbing  nor  diving  The  COMPASS  shows  that  it  is  headed  southeast.  The 
only  one  of  the  four  airplane  silhouettes  that  meets  these  specifications  is  in  the  box 
lettered  C,  so  the  answer  to  X  is  C.  Note  that  B  is  a  rear  view,  while  D  is  a  front  view. 
Note  also  that  A  is  banked  to  the  right  and  that  B  is  banked  to  the  left. 


A 

^  XT 

B 

C 

D 
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ARTIFICIAL 

HORIZON 


COMPASS 


A 

B 

Ss 

V 

A 

V 

B 

C 

'V 

0 

X 

A 


B 
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PART  11 

Block  Counting 

DIRECTIONS:  This  is  a  test  of  your  ability  to  "see  into"  a  3-dimensional  pile  of 
blocks  and  determine  how  many  pieces  are  touched  by  certain  numbered  blocks.  All 
of  the  blocks  tn  each  pile  are  the  same  sue  and  shape  Look  at  the  sample  below. 


|  KEY 

Biocfc 

A 

B 

C 

D 

E 

SI 

1 

2 

3 

4 

S 

S2 

3 

4 

s 

b 

7 

S3 

S 

6 

7 

8 

9 

S4 

2 

3 

.. 

4 

3 

6 

ss 

2 

3  1  4 
_ 1 _ 

S 

6 

Block  Si  touches  the  other  2  top  blocks  and  the  2  blocks  directly  below  it.  The 
total  number  of  blocks  touched  by  SI  is,  therefore,  4.  For  sample  problem  SI.  4  is 
choice  D  in  the  key  to  the  right. 

Block  S2  touches  blocks  1  and  3  and  the  unlettered  block  to  the  right  of  block  3. 
Since  block  S2  touches  3  other  blocks,  the  answer  is  3.  According  to  the  key,  3  is 
choice  A  for  sample  problem  S2.  Now  look  at  sample  problem  S3.  It  touches  3  blocks 
above,  3  blocks  below,  and  one  block  on  the  right.  Therefore,  the  correct  answer  is  7, 
so  C  is  the  correct  answer  to  sample  problem  S3. 

Now  count  the  blocks  touching  blocks  S4  and  S5.  For  block  S4,  the  correct  answer 
is  5,  so  D  would  be  the  correct  answer.  For  block  S5,  the  correct  answer  is  4,  so  C 
would  be  the  correct  answer 
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PART  12 

Table  Reading 


1)1  RUCTIONS:  I'hts  is  a  test  of  your  ability  to  read  tables  quickly  and  accurately. 
L.mk  at  the  table  below.  Notice  that  the  X  values  are  shown  at  the  top  of  the  table 
and  the  V  values  are  -hown  on  the  left  of  the  table  In  this  test,  you  are  to  find  the 
entry  that  occurs  at  the  intersection  of  the  row  and  the  column  corresponding  to  the 
values  given. 


X  \  \I.LK 


Y  VALLE 


-i 

-1 

•> 

♦  1 

4  2 

-3 

<- 5 

.2 

21 

25 

* 

25 

29 

10 

A.  ) 

23 

25 

27 

29 

30 

ii 

J2 

-  1 

24 

r 

2fi 

23 

30 

)2 

ii 

34 

0 

26 

27 

29 

31 

33 

U 

35 

-l 

29 

10 

32 

34 

35 

37 

_> 

23 

10 

11 

n 

Q 

fl 

m 

3 

3. 

12 

u 

n 

IT 
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_ 

X 

Y 

A 

B 

C 

i> 

E 

i 

0 

-1 

29 

33 

32 

35 

34 

o 

3 

-3 

22 

29 

23 

31 

28 

0 


HART  13 


Aviation  Information 


1)1  KKCTIONS:  l'h is  part  of  the  test  measures  your  knowledge  of  aviation.  Each  of 
the  question*  |jr  incomplete  statements  is  followed  bv  five  choices.  Decide  which  one 
of  the  choices  best  answers  the  question  or  completes  the  statement. 

1  If  the  elevator  tabs  on  a  plane  are  lowered,  the  plane  will  tend  to 

1-A  nose  up. 

1 -B  nose  down. 

1-C  pitch  fore  and  aft. 

1-D  go  into  a  slow  roll. 

1- K  wing  over. 

2  The  pilot  always  advances  the  throttle  during  a 

2- A  nosedive 

2~B  landing. 

2-C  turn. 

2-D  spin. 

2- K  climb 

3.  The  pilot  of  an  airplane  can  best  detect  the  approach  of  a  stall  by  the 

3- A  increase  in  speed  of  the  engine. 

3-B  increase  in  pitch  and  intensity  of  the  sound  of  the  air  moving  past 

the  plane. 

3-C  increase  in  effectiveness  of  the  rudder. 

3-D  ineffectivenessof  the  ailerons  and  elevator. 

3- E  decrease  in  pitch  and  intensity  of  the  sound  of  the  air  moving  past 

the  plane. 

4.  It  is  ordinarily  desirable  to  provide  an  unusually  long  flight  strip  at  municipal 
airports  for  the  takeoff  of 

4- A  military  planes  in  echelon. 

4-B  heavily  loaded  ships  in  still  air. 

4-C  small  airplanes  in  rainy  weather. 

4-D  any  airplane  across  the  wind. 

4- E  airplanes  that  have  high  climbing  speeds. 

5.  The  slipstream  of  an  airplane  will  have  the  least  effect  on  the  plane  s  direction  of 
flight  when  the  plane's 

5- A  stick  is  moved  forward. 

b-B  stick  is  moved  backward. 

5-C  stick  is  moved  sideways. 

5-D  rudder  bar  is  pressed  with  the  left  foot. 

5-E  rudder  bar  is  pressed  with  the  right  foot. 
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PART  14 

Rotated  Blocks 

1)1  KKCTIONS:  This  test  measures  your  ability  to  visualize  and  manipulate  objects  in  space.  In  each  item  of  this 
test  you  are  shown  a  picture  of  a  block.  The  problem  is  to  find  a  second  block  which  isjusl  like  the  first. 

Luuk  at  the  two  blocks  below.  Although  you  see  them  from  different  points,  the  blocks  are  just  alike. 

Look  at  the  two  blocks  below.  They  are  not  alike.  They  can  never  be  turned  so  that  they  will  bo  alike. 


Now  look  at  the  sample  item  below.  Which  of  the  five  choices  is  just  like  the  first  block? 


The  correct  answer  is  L).  It  is  the  same  block  as  seen  from  a  different  side. 
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Now  look  at  the  two  sample  items  below. 


The  right  answer  for  S2  is  C.  The  right  answer  for  S3  is  A. 
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PART  15 


General  Science 

IMKKCTIONS:  This  part  of  the  test  measures  your  knowledge  in  the  arena  of 
science.  Kach  of  the  questions  or  incomplete  statements  is  followed  by  five  choices. 
Decide  which  one  of  the  choices  best  answers  the  question  or  completes  the 
statement. 


An  eclipse 

1  f  the  sun  throws  th 

i-\ 

m  .i'll  m  the  son 

1-B 

earth  on  the  sun 

1-C 

sun  on  the  earth. 

1-D 

earth  on  the  moon. 

1-E 

moon  on  the  earth. 

2.  Substances  which  hasten  a  chemical  reaction  without  themselves  undergoing 
change  are  called 

2-A  buffers. 

2-R  catalysts. 

2-C  colloids. 

2-D  reducers. 

2- E  polymers. 

3.  Lack  of  iodine  is  often  related  to  which  of  the  following  diseases? 

3- A  Beriberi 

3-B  Scurvy 

3-C  Rickets 

3-D  Goiter 

3- E  Asthma 

4.  The  chief  nutrient  in  lean  meal  is 

4- A  starch. 

4-B  protein. 

4-C  fat. 

4-U  carbohydrates. 

4- E  Vitamin  B. 

">.  After  adding  salt  to  water,  the  freezing  point  of  the  water  is 

5- A  variable. 

5-B  inverted. 

5-C  the  same 

o-D  raised 

5-K  lowered. 
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PART  16 


Hidden  Figures 

DIRECTIONS:  This  part  of  the  test  measures  your  ability  to  see  a  simple  figure  in 
a  complex  drawing.  At  the  top  of  each  page  are  five  figures,  lettered  A,  B,  C,  D,  and 
E.  Below  these  on  each  page  are  several  numbered  drawings.  You  are  to  determine 
which  lettered  figure  is  contained  in  each  of  the  numbered  drawings. 

The  lettered  figures  are: 


As  an  example,  look  at  drawing  X  below. 


X  Y 


Which  one  of  the  five  figures  is  contained  in  drawing  X? 

Now  look  at  drawing  Y,  which  is  exactly  like  drawing  X  except  that  the  outline  of 
figure  B  has  been  made  heavy  to  show  where  to  look  for  it.  Thus,  B  is  the  answer  to 
sample  item  X. 

Each  numbered  drawing  contains  only  one  of  the  lettered  figures.  The  correct  figure 
in  each  drawing  will  always  be  of  the  same  size  and  in  the  same  position  as  itappears 
at  the  top  of  the  page  Therefore,  do  not  rotate  the  page  in  order  to  find  it.  Look  at 
each  numbered  drawing  and  <le<  ule  which  one  of  the  five  lettered  figures  is  contained 
in  it. 
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Sample  Item  Key 


Verbal  Analogies 

TT  D" 

2.  A 

3.  D 

4.  D 

5.  A 

Arithmetic  Reasoning 

TT  b 

2.  B 

3.  D 

4.  D 

5.  A 

Reading  Comprehension 

TT  C 

2.  D 

3.  C 

4.  A 

5.  B 

Data  Interpretation 

1.  D 

2.  B 

3.  E 

4.  A 

5.  E 

Word  Knowledge 

1.  C 

2.  D 

3.  C 

4.  D 

5.  A 

Math  Knowledge 

1.  B 

2.  C 

3.  C 

4.  A 

5.  B 

Me c h anical  Comprehension 

1 .  A 

2.  B 

3.  E 

4.  C 

5.  B 

Electrical  Maze 

1.  B 

2.  C 

3 .  E 

4.  A 

3.  D 


Scale  Reading 

TT  A 

2.  E 

3.  B 

4.  C 

5.  D 

Instrument  Comprehension 

TT  D 

2.  D 

3.  C 

4.  A 

5.  B 

Block  Counting 

TT — D 

2.  B 

3.  A 

4.  E 

5.  C 

Table  Reading 

1.  C 

2.  B 

3.  E 

4.  C 

5.  A 


Aviation  Information 


3.  D 

4.  B 

5.  C 

Rotated  Blocks 
TT  D 

2.  C 

3.  C 

4.  D 

5.  B 

General  Science 

1 .  E 

2.  B 

3.  D 

4.  B 

3 .  E 

Hidden  Figures 

1 .  A 

2.  A 

3.  E 

4.  D 

3.  C 
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Subtest  and  Composite  Raw  Scores 


77 


Table  C— 1.  Skew  and  Kurtosis  of  AFOQT  Form  P 
Subtest  and  Composite  Raw  Scores 


Subtes  t 

Number 
of  items 
scored 

71 — 

Skew 

- FT 

Kurtosis 

7i - rr 

Verbal  Analogies 

25 

-.41 

-.58 

-.32 

-.07 

Arithmetic  Reasoning 

25 

-.25 

-.20 

-.83 

-.81 

Reading  Comprehension 

25 

-.24 

-.39 

-.57 

-.64 

Data  Interpretation 

25 

-.27 

-.42 

-.57 

-.56 

Word  Knowledge 

25 

-.18 

-.12 

-.90 

-.98 

Math  Knowledge 

25 

-.42 

-.14 

-.99 

-1.10 

Mechanical  Comprehension 

20 

.11 

.09 

-.72 

-.73 

Electrical  Maze 

20 

.46 

.37 

.08 

.01 

Scale  Reading 

40 

-.09 

-.09 

-.58 

-.32 

Instrument  Comprehension 

20 

-.13 

-.25 

-1.13 

-1.155 

Block  Counting 

20 

-.43 

-.28 

-.35 

-.37 

Table  Reading 

40 

-.63 

-.40 

.43 

.10 

Aviation  Information 

10 

.54 

.54 

-.54 

-.54 

Rotated  Blocks 

15 

-.07 

-.23 

-.62 

-.73 

General  Science 

19 

.17 

.21 

-.72 

-.64 

Hidden  Figures 

15 

-.20 

-.24 

-.71 

-.51 

Composite 

Pilot 

204 

-.27 

-.34 

-.40 

-.30 

Navigator- Technical 

264 

-.32 

-.31 

-.4U 

-.39 

Academic  Aptitude 

150 

-.35 

-.34 

-.50 

-.56 

Verbal 

75 

-.27 

-.35 

-.58 

-.b2 

Quantitative 

75 

-.32 

-.24 

-.74 

-.85 
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APPENDIX  D:  Item  Analysis  Summary 
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Table  D-l.  Mean  Difficulties,  Biserials,  and  Point 
"  Biserials,  AFOQT  Form  Pi 


Subtest 

Mean  item 
difficulty 

Mean 

biserial 

Mean  point 
biserial 

Verbal  Analogies 

(VA) 

.648 

.573 

.410 

Arithmetic  Reasoning 

(AR) 

.619 

.667 

.500 

Reading  Comprehension 

(KC) 

.607 

.578 

•  43b 

Data  Interpretation 

(DI) 

.640 

.517 

.390 

Word  Knowledge 

(WK) 

.606 

.662 

.500 

Math  Knowledge 

(MK) 

.665 

.722 

.544 

Mechanical  Comprehension 

(MC) 

.496 

.545 

.429 

Electrical  Maze 

(EM) 

.386 

.549 

.411 

Scale  Reading 

(SR) 

.595 

.509 

.379 

Instrument  Comprehension 

(IC) 

.576 

.700 

.546 

Block  Counting 

(BC) 

.640 

.663 

.488 

Table  Reading 

(TR) 

.690 

.683 

.456 

Aviation  Information 

(AI) 

.447 

.595 

.461 

Rotated  Blocks 

(RB) 

.533 

.631 

.470 

General  Science 

(GS) 

.491 

.560 

.434 

Hidden  Figures 

(HF) 

.666 

.659 

.474 
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Table  D  2.  Distribution  of  Item  Difficulty,  AFOQT  Form  PI 


Subtest 

Number 

of  items 

witn  item 

difficulty  of 

.00  to 
.20 

.21  to 
.40 

.41  to 
.60 

.61  to 
.80 

.81  to 
.99 

Verbal  Analogies 

(VA) 

0 

3 

8 

8 

6 

Arithmetic  Reasoning 

(AR) 

0 

3 

8 

10 

4 

Reading  Comprehension 

(RC) 

0 

7 

3 

12 

3 

Data  Interpretation 

(DI) 

0 

1 

11 

9 

4 

Word  Knowledge 

(WK) 

0 

5 

7 

10 

3 

Math  Knowledge 

(MK) 

0 

0 

8 

13 

2 

Mechanical  Comprehension 

(MC) 

0 

3 

13 

4 

0 

Electrical  Maze 

(EM) 

6 

5 

4 

5 

0 

Scale  Reading 

(SR) 

0 

9 

11 

12 

8 

Instrument  Comprehension 

(1C) 

0 

2 

9 

9 

0 

Block  Counting 

(BC) 

0 

3 

6 

6 

3 

Table  Reading 

(TR) 

2 

7 

b 

3 

21 

Aviation  Information 

(AI) 

0 

8 

9 

1 

1 

Rotated  Blocks 

(RB) 

0 

4 

6 

2 

3 

General  Science 

(GS) 

0 

6 

8 

4 

1 

Hidden  Figures 

(HF) 

0 

1 

6 

4 

4 
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Table  D-3.  Distribution  of  Item  Discrimination,  AFOQT  Form  PI 


Subtest 

Number 

of  items 

with  item  biserial  of 

.00  to 
.20 

.21  to 
.40 

.41  to 
.60 

.61  to 
.80 

.81  to 
.99 

Verbal  Analogies 

(VA) 

0 

0 

16 

9 

0 

Arithmetic  Reasoning 

(AR) 

0 

0 

4 

20 

1 

Reading  Comprehension 

(RC) 

0 

1 

14 

10 

0 

Data  Interpretation 

(DI) 

0 

5 

13 

7 

0 

Word  Knowledge 

(WK) 

0 

0 

3 

22 

0 

Math  Knowledge 

(MK) 

0 

1 

3 

14 

7 

Mechanical  Comprehension 

(MC) 

0 

2 

14 

4 

0 

Electrical  Maze 

(EM) 

0 

1 

13 

6 

0 

Scale  Reading 

(SR) 

0 

10 

19 

11 

0 

Instrument  Comprehension 

(IC) 

0 

0 

4 

13 

3 

Block  Counting 

(BC) 

0 

0 

7 

12 

1 

Table  Reading 

(TR) 

0 

2 

9 

17 

12 

Aviation  Information 

(AI) 

0 

1 

8 

10 

0 

Rotated  Blocks 

(RB) 

0 

0 

6 

9 

0 

General  Science 

(GS) 

0 

2 

8 

9 

0 

Hidden  Figures 

(HF) 

0 

0 

3 

12 

0 
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Table  D-4.  Mean  Difficulties,  Biserials,  and  Point 
Biserials,  AFOQT  Form  P2 


Subtest 

Mean  item 
difficulty 

Mean 

biserial 

Mean  point 
biserial 

Verbal  Analogies 

(VA) 

.676 

.579 

.410 

Arithmetic  Reasoning 

(AR) 

.604 

.648 

.486 

Reading  Comprehension 

(RC) 

.663 

.631 

.468 

Data  Interpretation 

(DI) 

.667 

.574 

.425 

Word  Knowledge 

(WK) 

.603 

.052 

.497 

Math  Knowledge 

(MK) 

.610 

.668 

.514 

Mechanical  Comprehension 

(MC) 

.498 

.552 

.432 

Electrical  Maze 

(EM) 

.388 

.533 

.393 

Scale  Reading 

(SR) 

.563 

.482 

.3o2 

Instrument  Comprehension 

(IC) 

.599 

.740 

.572 

Block  Counting 

(BC) 

.585 

.606 

.447 

Table  Reading 

(TR) 

.690 

.b49 

.434 

Aviation  Information 

(AI) 

.476 

.607 

.4b8 

Rotated  Blocks 

(RB) 

.564 

.646 

.489 

General  Science 

(GS) 

.482 

.518 

.402 

Hidden  Figures 

(HF) 

.652 

.648 

.460 

Table  D-5 .  Distribution  of  Item  Difficulty,  AFOQT  Form  P2 


Subtest 

Number  of  items  with  item 

difficulty  of 

.00  to 
.20 

.21  to 
.40 

.41  to 
.60 

.bl  to 
.80 

.81  to 
.99 

Verbal  Analogies 

(VA) 

0 

2 

8 

7 

8 

Arithmetic  Reasoning 

(AR) 

1 

2 

9 

9 

4 

Reading  Comprehension 

(RC) 

0 

1 

9 

10 

b 

Data  Interpretation 

(DI) 

0 

1 

7 

14 

3 

Word  Knowledge 

(WK) 

0 

4 

9 

10 

2 

Math  Knowledge 

(MK) 

0 

0 

lb 

8 

2 

Mechanical  Comprehension 

(MC) 

0 

4 

10 

6 

0 

Electrical  Maze 

(EM) 

8 

3 

4 

b 

0 

Scale  Reading 

(SR) 

1 

12 

9 

12 

b 

Instrument  Comprehension 

(IC) 

0 

0 

11 

8 

1 

Block  Counting 

(BC) 

0 

b 

4 

7 

3 

Table  Reading 

(TR) 

1 

8 

4 

6 

21 

Aviation  Information 

(AI) 

0 

8 

8 

2 

1 

Rotated  Blocks 

(RB) 

0 

3 

7 

2 

3 

General  Science 

(GS) 

0 

6 

10 

2 

1 

Hidden  Figures 

(HF) 

0 

3 

4 

2 

b 

Table  D-6.  Distribution  of  Item  Discrimination,  AFOQT  Form  P2 


Subtest 

Number 

of  items 

with  item  biserial  of 

.00  to 
.20 

.21  to 
.40 

.41  to 
.60 

.61  to 
.80 

.81  to 
.99 

Verbal  Analogies 

(VA) 

0 

0 

14 

11 

0 

Arithmetic  Reasoning 

(AR) 

0 

0 

6 

19 

0 

Reading  Comprehension 

(RC) 

0 

0 

11 

14 

U 

Data  Interpretation 

(DI) 

0 

3 

10 

12 

0 

Word  Knowledge 

(WK) 

0 

0 

6 

18 

1 

Math  Knowledge 

(MK) 

0 

0 

8 

14 

3 

Mechanical  Comprehension 

(MC) 

0 

3 

10 

7 

0 

Electrical  Ma^e 

(EM) 

0 

3 

12 

5 

0 

Scale  Reading 

(SR) 

0 

11 

22 

7 

0 

Instrument  Comprehension 

(IC) 

0 

0 

3 

10 

7 

Block  Counting 

(BC) 

0 

2 

7 

11 

0 

Table  Reading 

(TR) 

0 

3 

11 

16 

10 

Aviation  Information 

(AI) 

0 

0 

7 

12 

0 

Rotated  Blocks 

(RB) 

0 

0 

4 

11 

0 

General  Science 

(GS) 

0 

4 

12 

3 

0 

Hidden  Figures 

(HF) 

0 

0 

3 

12 

0 
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APPENDIX  E 


AFOQT  Subtest  Intercorrelation  matrices 
for  Forms  PI  and  P2 
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Table  E-l .  Subtest  Raw  Score  Intercorrelations,  AFOQT  Form  PI 


VA 

AR 

RC 

DI 

WK 

MK 

MC 

EM 

SR 

IC 

BC 

TR 

AI 

RB 

GS 

AR 

.59 

RC 

.67 

.55 

DI 

.59 

.72 

.60 

WK 

.65 

.46 

.72 

.48 

MK 

.54 

.72 

.46 

.58 

.36 

MC 

.49 

.55 

.42 

.50 

.37 

.45 

EM 

.29 

.38 

.25 

.37 

.16 

.36 

.45 

SR 

.46 

.67 

.43 

.65 

.32 

.55 

.47 

.46 

IC 

.40 

.41 

.42 

.25 

.38 

.53 

.47 

.50 

BC 

.41 

.50 

.37 

.52 

.25 

.44 

.44 

.46 

.59 

.50 

TR 

.31 

.40 

.34 

.46 

.23 

.38 

.26 

.32 

.53 

.35 

.50 

AI 

.33 

.30 

.32 

.33 

.30 

.22 

.53 

.31 

.32 

.54 

.29 

.22 

RB 

.41 

.48 

.28 

.44 

.23 

.46 

.54 

.44 

.47 

.53 

.53 

.29 

.35 

GS 

.55 

.55 

.52 

.49 

.49 

.55 

.61 

.39 

.43 

.45 

.38 

.22 

.46 

.45 

HF 

.42 

.42 

.34 

.43 

.26 

.42 

.42 

.39 

.44 

.43 

.48 

.34 

.2b 

t  i 

•  J±- 

.  3o 

bb 


Table  E-2.  Subtest  Raw  Score  Intercorrelations,  AFOQT  Form  P2 


VA 

AR 

RC 

DI 

WK 

MK 

MC 

EM 

SR 

IC 

BC 

TR 

AI 

RB 

GS 

AR 

.62 

RC 

.72 

.56 

DI 

.64 

.76 

.61 

WK 

.70 

.47 

.74 

.52 

MK 

.53 

.74 

.46 

.64 

.37 

MC 

.52 

.54 

.46 

.53 

.42 

.43 

EM 

.31 

.40 

.27 

.40 

.17 

.38 

.45 

SR 

.45 

.63 

.42 

.61 

.31 

.53 

.43 

.45 

IC 

.42 

.  46 

.36 

.45 

.27 

.41 

.56 

.48 

.51 

BC 

.40 

.49 

.37 

.51 

.25 

.43 

.42 

.49 

.57 

.50 

TR 

.32 

.44 

.31 

.45 

.21 

.37 

.28 

.34 

.55 

.37 

.51 

A I 

.35 

.30 

.35 

.34 

.32 

.21 

.54 

.30 

.33 

.55 

.29 

.23 

Rd 

.40 

.49 

.30 

.47 

.25 

.46 

.54 

.45 

.47 

.56 

.53 

.32 

.35 

GS 

.56 

.55 

.53 

.52 

.54 

.51 

.65 

.40 

.42 

.49 

.37 

.24 

.49 

.4o 

HF 

.38 

.41 

.31 

.40 

.25 

.38 

.39 

.39 

.44 

.44 

.47 

.34 

.25 

.50 

.35 
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Table  F-l.  AFOQT  Form  PI  Pilot  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-38 

01 

102 

33 

13  b 

69 

3y-44 

02 

103 

34 

137 

70 

43-51 

03 

104 

35 

138 

71 

52-5o 

04 

105 

3o 

139 

75 

56-57 

05 

106 

37 

140 

74 

58-61 

06 

107 

38 

141 

75 

62-64 

07 

108 

39 

142 

76 

65-67 

08 

109 

41 

143 

77 

68 

09 

110-111 

42 

144 

78 

69-70 

10 

112 

43 

145 

79 

71-72 

11 

113 

44 

146 

80 

73-74 

12 

114 

45 

147 

81 

75-77 

13 

115 

46 

148 

02 

78 

14 

116 

47 

149 

83 

79 

15 

117 

48 

150-151 

84 

80 

16 

118 

50 

152 

85 

81-82 

17 

119 

51 

153-154 

8b 

83 

18 

120 

52 

155 

87 

84 

19 

121 

53 

156 

88 

85-87 

20 

122 

54 

157 

89 

88 

21 

123 

55 

158 

90 

89 

22 

124 

56 

159 

91 

90 

23 

125 

57 

160 

92 

91-92 

24 

126 

58 

161 

93 

93 

25 

127 

60 

lb2-163 

94 

94 

26 

128 

61 

164-165 

95 

95 

27 

129 

62 

166-168 

9b 

96-97 

28 

130-131 

o3 

lby-173 

97 

98 

29 

132 

64 

174-179 

98 

99 

30 

133 

65 

180-204 

99 

100 

31 

134 

66 

101 

32 

135 

67 

92 


Table  F-2.  AFOQT  Form  PI  Navigator-Technical  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-58 

01 

138 

34 

178 

6  7 

59-69 

02 

139 

35 

179 

b8 

70-75 

03 

140-141 

36 

180 

09 

lb-19 

04 

142 

37 

181 

70 

80-83 

05 

143-144 

38 

182 

71 

84-85 

06 

145 

39 

183 

72 

86-88 

07 

14b 

40 

184-185 

73 

89-91 

08 

14  7 

41 

lbb-lo7 

74 

92-95 

09 

148 

42 

188 

7  3 

96-97 

10 

149-151 

43 

189 

7  o 

98-99 

11 

152 

44 

190 

11 

100-102 

12 

153 

45 

191 

76 

103-104 

13 

154 

4b 

192-193 

79 

105-106 

14 

155 

47 

194 

80 

107-109 

25 

156 

48 

195-19b 

8l 

110-111 

16 

157 

49 

197 

82 

112-113 

17 

158 

50 

198-199 

8  3 

114-116 

18 

159 

51 

200 

84 

117 

19 

160-161 

52 

201 

b5 

118-119 

20 

162 

53 

202-203 

86 

120-121 

21 

163 

54 

204-205 

8/ 

122 

22 

164 

55 

206-207 

88 

123-124 

23 

165 

5b 

208-209 

89 

125 

24 

166 

57 

210-211 

90 

126-127 

25 

167 

58 

212-213 

91 

128 

26 

168 

59 

214 

92 

129 

27 

169 

60 

215-216 

93 

130 

28 

170 

Dl 

217-219 

131-132 

29 

171 

o2 

220-222 

93 

133-134 

30 

172-173 

63 

223-227 

96 

135 

31 

174 

64 

228-232 

97 

136 

32 

175-176 

65 

233-236 

98 

137 

33 

177 

66 

237-284 

99 

hi 


Table  F-3.  AFOQT  Form  Pi  Academic  Aptitude  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-27 

01 

82 

29 

112 

69 

28-34 

02 

83 

31 

113 

70 

35-39 

03 

84 

33 

114 

71 

40-41 

04 

85 

34 

115 

72 

42-45 

05 

86 

35 

116 

75 

46-48 

06 

87 

36 

117 

76 

49-50 

07 

88 

37 

118 

78 

51 

08 

89-90 

38 

119 

79 

52-55 

09 

91 

40 

120 

80 

56-57 

10 

92 

41 

121 

81 

58-59 

11 

93 

43 

122 

82 

60 

12 

94 

44 

123 

83 

61 

13 

95 

45 

124 

84 

62 

14 

96 

47 

125 

85 

63 

15 

97 

49 

l2t> 

86 

64-66 

16 

98 

50 

127 

8  7 

67 

17 

99 

51 

128 

88 

68-69 

18 

100 

52 

129 

89 

70 

19 

101 

53 

130 

90 

71 

20 

102-103 

54 

131 

91 

72-73 

21 

104 

57 

132 

92 

74 

22 

105 

59 

133-134 

93 

75 

23 

106 

61 

135 

94 

76 

24 

107 

62 

13b-137 

95 

77 

25 

108 

63 

138-139 

96 

78 

26 

109 

65 

140-141 

97 

79 

27 

110 

67 

142-143 

98 

80-81 

28 

111 

68 

144-150 

99 

94 


Table  F-4.  AFOQT  Form  PI  Verbal  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-14 

01 

34 

23 

53 

b2 

15 

02 

35 

24 

54 

64 

16-17 

03 

36 

26 

55 

b7 

18 

04 

37 

27 

56 

b9 

19 

05 

38 

30 

57 

72 

20 

06 

39 

32 

58 

74 

21 

07 

40 

33 

59 

11 

22 

08 

41 

3b 

60 

78 

23 

09 

42 

38 

bl 

81 

24 

10 

43 

40 

62 

84 

25 

11 

44 

41 

63 

8b 

26 

12 

45 

44 

b4 

87 

27 

13 

46 

4b 

b5 

90 

28 

14 

47 

48 

6b 

92 

29 

15 

48 

50 

b7 

93 

30 

17 

49 

53 

68 

9b 

31 

18 

50 

55 

69 

97 

32 

19 

51 

57 

70 

98 

33 

21 

52 

60 

71-75 

99 

95 


Table  F-5.  AFOQT  Form  Pi  Quantitative  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-14 

01 

39 

28 

60 

71 

15-16 

02 

40-41 

31 

61 

75 

17-19 

03 

42 

33 

62 

7b 

20 

04 

43-44 

34 

63 

78 

21 

05 

45 

38 

64 

80 

22 

06 

46 

41 

65 

82 

23-24 

08 

47-48 

43 

66 

85 

25 

09 

49 

45 

67 

86 

26 

10 

50 

48 

68 

90 

27-28 

11 

51-52 

52 

69 

91 

2y 

14 

53 

54 

70 

92 

30 

15 

54 

57 

71 

94 

31-32 

17 

55 

59 

72 

95 

33 

19 

56 

61 

73 

97 

34-35 

21 

57 

64 

74 

98 

36 

24 

58 

66 

75 

99 

37-38 

26 

59 

69 

96 


Table  F-6.  AFOQT  Form  P2  Pilot  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-41 

01 

101 

33 

134 

69 

42-46 

02 

102 

34 

135 

70 

47-53 

03 

103 

35 

13o 

71 

54-57 

04 

104 

3b 

137 

73 

58-59 

05 

105 

37 

138 

74 

60-62 

06 

106 

38 

139 

73 

63-65 

07 

107 

39 

140 

7b 

66-67 

08 

108 

41 

141 

77 

68 

09 

109 

42 

142-143 

76 

69-70 

10 

110 

43 

144 

79 

71-73 

11 

111 

44 

143 

80 

74-75 

12 

112 

45 

14  b 

61 

76-77 

13 

113 

4o 

147 

82 

78 

14 

114 

47 

148 

63 

79 

15 

115 

48 

149-150 

64 

80 

16 

116 

50 

151 

85 

81-82 

17 

117 

51 

152-153 

66 

83 

18 

118 

52 

134 

67 

84 

19 

119-120 

53 

155 

88 

85-86 

20 

121 

54 

13  b 

89 

87 

21 

122 

55 

13/ 

90 

88 

22 

123 

56 

138 

91 

89 

23 

124 

57 

159 

92 

90-91 

24 

125 

58 

ibO 

93 

92 

25 

126 

60 

161-162 

94 

93 

26 

127 

61 

163-164 

95 

94 

27 

128 

b2 

lb3— 108 

90 

95-96 

28 

129 

63 

169-173 

97 

97 

29 

130 

64 

174-179 

96 

98 

30 

131 

b5 

180-204 

99 

99 

31 

132 

66 

100 

32 

133 

67 

97 


Table  F-7.  AFOQT  Form  P2  Navigator-Technical  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

P^aw  score 

Percentile 

0-59 

01 

135 

34 

174 

67 

60-69 

02 

136 

35 

175 

68 

70-74 

03 

137 

36 

17b 

b9 

75-79 

04 

138 

37 

177 

70 

80-82 

05 

139-140 

38 

178 

71 

83-84 

06 

141-142 

39 

179 

72 

85-87 

07 

143 

40 

180-181 

73 

88-90 

08 

144 

41 

182 

74 

91-93 

09 

145 

42 

183-184 

75 

94-95 

10 

146-147 

43 

185 

76 

96-98 

11 

148 

44 

186 

77 

99-100 

12 

149 

45 

187 

78 

101-102 

13 

150 

46 

188-189 

79 

103-104 

14 

151 

47 

190 

80 

105-106 

15 

152 

48 

191-192 

81 

107-109 

16 

153 

49 

193 

82 

110-111 

17 

154-155 

50 

194-195 

83 

112-113 

18 

156 

51 

196 

84 

114 

19 

157 

52 

197 

85 

115-116 

20 

158 

53 

198 

86 

117-119 

21 

159 

54 

199-201 

87 

120 

22 

160 

55 

2U2-203 

88 

121 

23 

161 

56 

204-205 

89 

122 

24 

162 

57 

206-207 

90 

123-124 

25 

163 

58 

208-209 

91 

125 

26 

164 

59 

210 

92 

126 

27 

165 

60 

211-212 

93 

127 

28 

166 

61 

213-215 

94 

128 

29 

167 

62 

216-219 

95 

129-131 

30 

168-169 

63 

220-223 

96 

132 

31 

170 

64 

224-228 

97 

133 

32 

171-172 

65 

229-232 

98 

134 

33 

173 

6b 

233-264 

99 

98 


Table  F-8.  AFOQT  Form  P2  Academic  Aptitude  Composite  Conversion  Table 


Raw  score  Percentile 


Raw  score  Percentile  Raw  score  Percentile 


0-29  01 

30-34  02 

35-39  03 

40-41  04 

42-44  05 

45-47  06 

48-49  07 

50  08 

51-54  09 

55-56  10 

57  11 

58-59  12 

60  13 

61  14 

62  15 

63-65  16 

66  17 

67-68  18 

69-70  19 

71  20 

72  21 

73  22 

74-75  23 

76  24 

77  25 

78  26 

79-80  27 

81  28 


82 

29 

83 

31 

84 

33 

85-86 

34 

87 

35 

88 

36 

89 

37 

90-91 

38 

92 

40 

93 

41 

94 

43 

95 

44 

96-97 

45 

98 

47 

99 

49 

100 

50 

101 

51 

102 

52 

103 

53 

104-105 

54 

106 

57 

107 

59 

108 

61 

109 

62 

110 

63 

111 

65 

112 

67 

113 

68 

114 

69 

115 

70 

116-117 

71 

118 

72 

119 

75 

12  u 

76 

121 

76 

122 

79 

123 

8U 

124 

81 

125 

82 

126 

83 

127 

83 

128 

66 

129 

87 

130 

68 

131 

89 

132 

90 

133 

91 

134 

92 

135-136 

93 

137 

94 

138 

95 

139-140 

96 

141-142 

97 

143 

98 

144-150 

99 

99 


Table  F-9.  AFOQT  Form  P2  Verbal  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Raw  score 

Percentile 

0-15 

01 

36 

23 

55 

62 

16 

02 

37 

24 

56 

64 

17-18 

03 

38 

26 

57 

67 

19 

04 

39 

27 

58 

69 

20 

05 

40 

30 

59 

72 

21 

06 

41 

32 

60 

74 

22 

07 

42 

33 

61 

77 

23 

08 

43 

36 

62 

78 

24 

09 

44 

38 

63 

81 

25 

10 

45 

40 

64 

84 

26 

11 

46 

41 

65 

86 

27 

12 

47 

44 

66 

87 

28 

13 

48 

46 

67 

90 

29 

14 

49 

48 

68 

92 

30 

15 

50 

50 

69 

93 

31 

17 

51 

53 

70 

96 

32-33 

18 

52 

55 

71 

97 

34 

19 

53 

57 

72 

9  8 

35 

21 

54 

60 

73-75 

99 

100 


Table  F-10.  AFOOT  Form  P2  Quantitative  Composite  Conversion  Table 


Raw  score 

Percentile 

Raw  score 

Percentile 

Kaw  score 

Percentile 

0-14 

01 

37-38 

2b 

59 

71 

15-16 

02 

39 

31 

60-61 

75 

17-18 

03 

40-41 

33 

o2 

7b 

19 

04 

42 

34 

63 

7  b 

20 

05 

43-44 

38 

b4 

bz 

21 

06 

45 

41 

00 

bo 

22-23 

08 

46 

43 

60 

8b 

24 

09 

47-48 

45 

o7 

bb 

25 

10 

49 

4b 

ob 

yu 

26-27 

11 

50 

52 

69 

91 

28 

14 

51 

54 

70 

92 

29 

15 

52-53 

57 

71 

94 

30-31 

17 

54 

59 

72 

90 

32 

19 

55 

61 

/' j 

9b 

33 

21 

56 

b4 

74 

yb 

34-35 

24 

57 

66 

/  o 

99 

36 

26 

58 

b9 
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